Public summary
Seeking an experienced Python Data Scraping Engineer to work remotely on a freelance basis, focusing on complex web extraction and data processing tasks leveraging a hybrid AI and human system. The role involves developing and maintaining robust data scraping workflows using tools such as Apify and OpenRouter, ensuring high-quality, structured data delivery from dynamic web sources. Candidates should have strong Python skills, experience with dynamic content scraping, data cleaning, and validation, and be comfortable working independently in English.
Salary
USD 37.00 - 37.00 hour
Responsibilities
Manage end-to-end data extraction processes from complex websites, ensuring accuracy and completeness of structured datasets. Utilize internal and custom tools to enhance scraping efficiency and data validation. Adapt methods to handle dynamic and interactive web content, enforce data quality standards through rigorous checking, and scale operations for large datasets while maintaining stability and monitoring for failures.
Qualifications
Minimum three years of experience in data engineering, web scraping, automation, or software development. Proficient in Python-based web scraping frameworks such as BeautifulSoup and Selenium, with experience handling dynamic JavaScript-driven content and APIs. Skilled in data cleaning, normalization, and preparing datasets in formats like CSV, JSON, or Google Sheets. Experience with large language models and AI tools is advantageous. Upper-intermediate or higher English proficiency (B2+) is required. Bachelor's or Master's degree in relevant technical fields is a plus. A GitHub portfolio is an advantage.