Deep Learning System Optimization Engineer

See more jobs from Snowflake Inc.

5 months old

This job is no longer active

Build the future of data. Join the Snowflake team.

We are seeking talented System Developers and Researchers to join our team and work on GPU kernel optimizations and deep learning system optimizations. Our focus is on building the most efficient and scalable deep learning systems. This is an exciting opportunity to collaborate with a strong team, including founding members of DeepSpeed, vLLM, and TensorFlow, and to contribute to the development of Snowflake Arctic, the Snowflake foundation model family. Together, we will innovate, build, and advance the state of the art in deep learning systems.

RESPONSIBILITIES:

  • Analyze and optimize GPU kernel performance for deep learning models.
  • Develop and implement strategies to enhance the efficiency and scalability of deep learning systems.
  • Profile and benchmark deep learning systems using tools and techniques to identify bottlenecks.
  • Design and implement optimizations to reduce latency and improve resource utilization for training and inference.
  • Build a strong system foundation for Snowflake Arctic by interacting with model scientists for model-system co-development.
  • Stay updated with the latest advancements in GPU kernel optimization, deep learning, and LLM system development.
  • Publish their innovations, optimizations, and engineering practices in technical blogs, top-tier conferences and journals.

REQUIREMENTS:

  • Bachelor’s degree in Computer Science, Electrical Engineering, or a related field. A Master’s degree or PhD is preferred.
  • 5 years of experience in GPU kernel optimization, deep learning system optimization, or high-performance computing (HPC).
  • Proficiency in programming languages such as C/C++ and Python.
  • Strong understanding of GPU architectures and experience with CUDA or similar frameworks.
  • Experience with profiling tools (e.g., nvprof, Nsight) and performance analysis methodologies.
  • Solid problem-solving skills and ability to debug complex performance issues.
  • Experience with version control systems (e.g., Git) and collaborative development practices.
  • Excellent communication skills and ability to work effectively in a cross-functional team environment.

Join us in optimizing deep learning systems and pushing the boundaries of AI efficiency. Apply now to be part of our dynamic and pioneering team!

Every Snowflake employee is expected to follow the company’s confidentiality and security standards for handling sensitive data. Snowflake employees must abide by the company’s data security plan as an essential part of their duties. It is every employee's duty to keep customer information secure and confidential.