What are the responsibilities and job description for the Student Researcher – Machine Learning, Internship position at Jobright.ai?
Verified Job On Employer Career Site
Job Summary:
ByteDance is a leading technology company dedicated to pioneering advanced AI foundation models. They are seeking a Student Researcher for 2025 to research and develop machine learning systems, focusing on heterogeneous computing architecture and optimizing AI algorithms and hardware for machine learning.
Responsibilities:
• Research and develop our machine learning systems, including heterogeneous computing architecture, management, scheduling, and monitoring.
• Manage cross-layer optimization of system and AI algorithms and hardware for machine learning (GPU, ASIC).
• Implement both general purpose training framework features and model specific optimizations (e.g. LLM, diffusions).
• Improve efficiency and stability for extremely large scale distributed training jobs.
Qualifications:
Required:
• Currently in PhD program in distributed, parallel computing principles and know the recent advances in computing, storage, networking, and hardware technologies.
• Familiar with machine learning algorithms, platforms and frameworks such as PyTorch and Jax.
• Have basic understanding of how GPU and/or ASIC works.
• Expert in at least one or two programming languages in Linux environment: C/C , CUDA, Python.
• Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.
Preferred:
• GPU based high performance computing, RDMA high performance network (MPI, NCCL, ibverbs).
• Distributed training framework optimizations such as DeepSpeed, FSDP, Megatron, GSPMD.
• AI compiler stacks such as torch.fx, XLA and MLIR.
• Large scale data processing and parallel computing.
• Experiences in designing and operating large scale systems in cloud computing or machine learning.
• Experiences in in-depth CUDA programming and performance tuning (cutlass, triton).
Company:
ByteDance is a technology company that develops content creation platforms and services. Founded in 2012, the company is headquartered in Beijing, Beijing, CHN, with a team of 10001 employees. The company is currently Late Stage.