Public summary
Looking for an experienced Python Data Scraping Engineer to work remotely on specialized data extraction and processing projects within a hybrid AI and human system. The role involves developing and maintaining complex web scraping workflows using tools like Apify and OpenRouter, handling dynamic web content, ensuring data quality and validation, and scaling extraction operations. This is a freelance, part-time opportunity with flexible scheduling and competitive hourly compensation up to $37 depending on contribution level.
Salary
USD 37.00 - 37.00 hour
Responsibilities
Develop end-to-end data extraction workflows to collect structured datasets from complex and dynamic websites. Utilize both internal tools and custom methods to accelerate data collection and validation. Handle extraction challenges including JavaScript-rendered content and changing site structures. Enforce strict data quality standards with validation and cross-checks. Scale scraping operations efficiently while maintaining stability and monitoring failures.
Qualifications
Minimum 3 years of experience in data engineering, web scraping, automation, or software development. Strong proficiency in Python web scraping libraries such as BeautifulSoup and Selenium, including handling dynamic content using proxies and APIs. Experience extracting data from complex structures with solid skills in data cleaning, normalization, and validation delivering structured outputs like CSV, JSON. Hands-on experience with large language models and AI frameworks is preferred. Upper-intermediate English (B2) proficiency or above required. Bachelor's or Master's degree in Engineering, Applied Mathematics, Computer Science, or related fields is a plus. A GitHub profile is advantageous.