What are the responsibilities and job description for the Mid-Level Software Engineer - Post-training Data position at Jobright.ai?
Verified Job On Employer Career Site
Job Summary:
Magic is an AI coding startup that enables developers to work with AI to find code for building apps. As a Software Engineer working on post-training data infrastructure, you will write efficient data pipelines and collaborate with research and applied teams to gather and maintain diverse datasets at scale.
Responsibilities:
• Develop creative ways to obtain post-training datasets teaching the model targeted capabilities.
• Develop creative ways to reliably generate synthetic datasets
• Iterate on filtering and scoring heuristics for a post-training dataset
• Contribute to our data infrastructure by implementing, maintaining and testing data pipelines serving workloads across scales from gigabytes to 100s of petabytes
Qualifications:
Required:
• Extreme attention to detail and commitment to data quality.
• Ability to write reliable, well-tested code and quickly learn new tools, programming languages or frameworks needed for a given job.
• Versatility in end-to-end data pipelines, from scraping to processing.
• High intellectual agility and grit to tackle tough challenges.
Company:
Magic is an AI coding startup that enables developers to work with AI to find code for building apps. Founded in 2022, the company is headquartered in San Francisco, California, USA, with a team of 2-10 employees. The company is currently Growth Stage. Magic has a track record of offering H1B sponsorships.