Jobs / Summary

Senior Data Scraping Engineer (Python)

Confidential company · Hamburg · Posted May 14, 2026

Public summary

A remote freelance role for an experienced Python Data Scraping Engineer to handle end-to-end web data extraction and processing for AI-related projects. The position involves working with AI and human hybrid systems, utilizing tools like Apify and OpenRouter, to ensure accurate and reliable structured datasets from complex dynamic websites. Candidates should have strong Python scraping experience, data expertise, and an independent work ethic, with proficiency in English at least B2 level.

Salary

USD 37.00 - 37.00 hour

Responsibilities

Own comprehensive data scraping workflows across complex and dynamic websites, ensuring accuracy and structured delivery. Utilize internal tools and custom methods to accelerate data extraction and validation. Adapt techniques to handle JavaScript-rendered and interactive content. Maintain data quality through rigorous validation and consistency checks. Scale operations efficiently while monitoring for failures and adapting to site changes.

Qualifications

Minimum 3 years of experience in data engineering, web scraping, automation, or software development. Strong proficiency in Python web scraping libraries (BeautifulSoup, Selenium) and handling dynamic content and APIs using proxies. Proven skills in extracting data from complex hierarchical and inconsistent HTML structures. Experience in data cleaning, normalization, validation, and delivering datasets in various formats (CSV, JSON, Google Sheets). Hands-on experience with large language models and AI frameworks to support automation. Self-directed with strong attention to detail. Upper-intermediate English proficiency (B2+) required. Bachelor’s or Master’s degree in relevant technical fields is a plus. GitHub link is advantageous.

Skills

Python web scraping BeautifulSoup Selenium JavaScript AJAX API integration data extraction data cleaning data validation automation software development large language models AI frameworks proxy usage