Public summary
Join a fast-growing AI company based in Munich aiming to remove barriers to learning via innovative text-to-speech technology. The role focuses on data collection and ingestion pipeline management using cloud infrastructure and supports model training operations. Work remotely within a diverse, entrepreneurial team committed to impactful products for millions of users globally.
Responsibilities
Identify and integrate new audio data sources into the ingestion pipeline; operate and enhance cloud infrastructure on GCP using Terraform; collaborate with AI scientists to optimize data quality, scale, and cost; contribute to the AI team's dataset strategy for advancing consumer and enterprise products.
Qualifications
Bachelor’s or higher degree in Computer Science or related field; over 5 years of software development experience; proficiency in bash and Python scripting in Linux; expertise in Docker and Infrastructure-as-Code; experience with major cloud platforms (preferably GCP); knowledge of web crawlers and large-scale data workflows is a plus; strong multitasking and communication skills.