Public summary
A remote freelance role for an experienced Python Data Scraping Engineer to handle end-to-end web data extraction and processing for AI-related projects. The position involves working with AI and human hybrid systems, utilizing tools like Apify and OpenRouter, to ensure accurate and reliable structured datasets from complex dynamic websites. Candidates should have strong Python scraping experience, data expertise, and an independent work ethic, with proficiency in English at least B2 level.
Salary
USD 37.00 - 37.00 hour
Responsibilities
Own comprehensive data scraping workflows across complex and dynamic websites, ensuring accuracy and structured delivery. Utilize internal tools and custom methods to accelerate data extraction and validation. Adapt techniques to handle JavaScript-rendered and interactive content. Maintain data quality through rigorous validation and consistency checks. Scale operations efficiently while monitoring for failures and adapting to site changes.
Qualifications
Minimum 3 years of experience in data engineering, web scraping, automation, or software development. Strong proficiency in Python web scraping libraries (BeautifulSoup, Selenium) and handling dynamic content and APIs using proxies. Proven skills in extracting data from complex hierarchical and inconsistent HTML structures. Experience in data cleaning, normalization, validation, and delivering datasets in various formats (CSV, JSON, Google Sheets). Hands-on experience with large language models and AI frameworks to support automation. Self-directed with strong attention to detail. Upper-intermediate English proficiency (B2+) required. Bachelor’s or Master’s degree in relevant technical fields is a plus. GitHub link is advantageous.