Public summary
A freelance part-time remote role is available for an experienced Python Data Scraping Engineer to join a hybrid AI and human collaboration project. The role involves developing and managing specialized data extraction workflows using tools such as Apify and OpenRouter along with custom solutions, focusing on extracting and validating structured data from complex and dynamic web sources. Candidates should have at least 3 years of relevant experience, strong Python web scraping skills including handling dynamic content, and solid data processing expertise. English proficiency at an upper-intermediate level or above is required. The position offers flexible remote work with performance-based compensation up to $37 per hour equivalent.
Salary
USD 37.00 - 37.00 hour
Responsibilities
Own end-to-end data extraction workflows across complex websites, assuring accuracy and reliable delivery of structured datasets. Use internal tools and custom workflows to collect, validate, and execute data scraping tasks meeting project requirements. Adapt scraping methods to handle dynamic and interactive web content such as JavaScript-rendered pages. Implement data quality standards including validation checks and consistency controls before delivery. Scale operations for large datasets with efficient batching or parallel processing, monitor for failures, and maintain workflow stability despite site changes.
Qualifications
Minimum 3 years of experience in data engineering, web scraping, automation, or software development. Bachelor's or Master's degree in Engineering, Applied Mathematics, Computer Science, or related fields is a plus. Strong expertise in Python web scraping libraries and frameworks (BeautifulSoup, Selenium), including handling dynamic and API-driven content through proxies. Proven ability to extract data from complex and inconsistent web structures. Experience with data cleaning, normalization, validation producing structured outputs like CSV, JSON, or Google Sheets. Hands-on experience with large language models (LLMs) and AI frameworks to support automation. Excellent attention to detail, self-directed problem-solving capability, and proficiency in English at upper-intermediate level (B2) or above. A GitHub portfolio is advantageous.