Staff Software Engineer, Diagnostics

See more jobs from MongoDB

about 2 months old

Apply Now

MongoDB’s mission is to empower innovators to create, transform, and disrupt industries by unleashing the power of software and data. We enable organizations of all sizes to easily build, scale, and run modern applications by helping them modernize legacy workloads, embrace innovation, and unleash AI. Our industry-leading developer data platform, MongoDB Atlas, is the only globally distributed, multi-cloud database and is available in more than 115 regions across AWS, Google Cloud, and Microsoft Azure. Atlas allows customers to build and run applications anywhere—on premises, or across cloud providers. With offices worldwide and over 175,000 new developers signing up to use MongoDB every month, it’s no wonder that leading organizations, like Samsung and Toyota, trust MongoDB to build next-generation, AI-powered applications.

MongoDB is seeking highly technical candidates for a role on the Server Triage and Release team as a Staff Diagnostics Engineer. Server Triage and Release is a support and diagnostics focused group within the engineering organization. 

In this role, you'll continually improve our diagnostic processes by contributing to applications and tools that support this mission. In addition, you'll have the opportunity to lead our response to investigations into some of the most challenging defects, collaborating with software engineers across the organization, as well as senior technical service engineers. 

An ideal candidate has experience writing high quality application code, can communicate complex technical concepts well, and has great diagnostic intuition as it applies to complex distributed architectures.

This role can be based remotely out of Canada.

What you’ll do

  • Lead complex projects to improve our ability to identify and respond to issues 
  • Write code to expand our diagnostic toolset, including contributing to and evolving electron apps that were developed by our team to investigate diagnostic data. 
  • Work with the Engineering and Technical Services teams to debug and reproduce bug reports from MongoDB users and escalate problems as needed
  • Advocate for a user-oriented perspective, advise on possible solutions and help MongoDB users understand complex technical issues and options to mitigate or resolve.

Ideally, you will have

  • 7+ years of experience in software development, with a focus on data management systems.
  • Expertise in diagnosing thorny technical issues central to databases: distributed systems, consensus algorithms, data replication, query optimization, data storage, OS internals, concurrency and scheduling, networking, etc.
  • Experience using and interpreting results of standard profiling tools, such as perf, eBPF, or gdb.
  • Experience supporting production environments, and/or working directly with end-users to investigate and diagnose technical issues
  • Ability to:
    • Design and steer fullstack projects, preferably in Typescript, Python, or Go
    • Stand for code quality and software design best practices 
    • Quickly grok and clearly synthesize implications of system behavior
    • Read and understand the intent of code and stack traces in many languages, especially C++
  • Excellent communication skills (both written and verbal) as you will be working with users from all over the world with very diverse backgrounds, as well as with a highly technical engineering team.

Success Measures

In 3 Months:

  • You are comfortable handling tickets with identifiable diagnostic signatures
  • You can identify user reported tickets that lack sufficient diagnostic detail 
  • You have made contributions to our diagnostic tools
  • You are advising on metrics to use for assessing the health of a rollout 

In 6 Months:

  • You are mentoring members of the team on diagnostics and code quality 
  • You have made significant contributions to our diagnostic tools
  • You are acting as a representative on internal rollout committees 
  • You can speak to strengths and weaknesses in product diagnostic capabilities
  • You can lead the response to some escalations from our experienced Technical Services Team

In 12 Months:

  • You are running a project to improve our diagnostic tools
  • You have a deep understanding in one or more Server components
  • You are consulting internally within MongoDB on efforts to adopt and leverage additional diagnostics such as OpenTelemetry 

To drive the personal growth and business impact of our employees, we’re committed to developing a supportive and enriching culture for everyone. From employee affinity groups, to fertility assistance and a generous parental leave policy, we value our employees’ wellbeing and want to support them along every step of their professional and personal journeys. Learn more about what it’s like to work at MongoDB, and help us make an impact on the world!

MongoDB is committed to providing any necessary accommodations for individuals with disabilities within our application and interview process. To request an accommodation due to a disability, please inform your recruiter.

MongoDB is an equal opportunities employer.