Key job responsibilities - Design complex data collections with human participants in response to science needs: author instructions, define and implement quality targets and mechanisms, provide day-to-day coordination of data collection efforts (including planning, scheduling, and reporting), and be responsible for the final deliverables - Design and conduct complex data creation tasks using synthetic and model-based data generation methods, following state-of-the-art approaches - Analyze and extract insights from large amounts of data - Build tools or tool prototypes for data analysis or data creation, using Python or another scripting language - Use modeling tools to bootstrap or test new AI functionalities - Collaborate with scientists, software engineers, and other data creators to evaluate performance of AI models About the team Amazon strives to be the world's most customer-centric company, where customers can research and purchase anything they might want online or offline. We are looking for Language Engineers to join our science and engineering team to support the development of complex, multimodal datasets, using a range of approaches including synthetic data generation, model-supported data generation, and human-in-the-loop data collections.