Product Reliability Engineer

See more jobs from Palantir Technologies Inc

over 5 years old

Apply Now

A World-Changing Company

Palantir builds the world’s leading software for data-driven decisions and operations. By bringing the right data to the people who need it, our platforms empower our partners to develop lifesaving drugs, forecast supply chain disruptions, locate missing children, and more.

The Role

Product Reliability Engineers (PREs) are the driving forces of stability across Palantir’s products. Product Reliability Engineers help to ensure our products are available 24/7. When something goes wrong, Product Reliability Engineers are the first to respond and are responsible for triaging, troubleshooting, and coordinating the resolution of the issue.

Every day at Palantir is different: we’re constantly evolving to better respond to customer needs, and as a PRE you will embed with our engineering and business teams to minimize risks associated with the deployment of our products. You are a resourceful, creative, and agile problem solver who is able to work both collaboratively and independently to resolve the most difficult and nebulous technical issues. This includes creating product health metrics and automated alerts, fixing product bugs, and developing and documenting strategies for responding to incidents.

Whatever the technical issue or question about Palantir’s products is, you’ll play a central and critical role in resolving it — seeking not just a one-time fix, but a permanent solution. 

Core Responsibilities

  • Develop a deep understanding of Palantir's products and processes.
  • Collaborate with customer-facing, product, and infrastructure teams on the development and deployment of scalable, reliable software for our customers.
  • Diagnose, resolve, and prevent issues encountered in the field
  • Reduce the operational overhead of Palantir’s products and leverage data to understand the largest sources of reliability risk.
  • Deliver end-to-end improvements to stability by proactively preventing issues via telemetry and automation and directly reducing the need for reactive support.
  • Make data-driven decisions about investments in stability and reliability.
  • Take part in a 24/7 on-call rotation responsible for coordinating Palantir’s response to mission-critical incidents, ensuring efficient resolution with minimal customer impact.
  • What We Value

  • Excellent problem solving skills, ability to break down and explain complex concepts, and strong attention to detail.
  • Comfort working in a fast moving environment with dynamic objectives that require creative thinking to address product and customer needs.
  • Ability to work both independently and make decisions under minimal supervision, as well as collaborate as part of a team.
  • Experience coding with Java, Go and/or web technologies (e.g. HTML, CSS, JavaScript, Python/Ruby, Django/Flask/Ruby on Rails, etc.) is a plus.
  • Experience with distributed computing systems and/or cloud infrastructures (e.g. Spark, Hadoop, YARN, Kubernetes, AWS, etc.) is a plus.
  • What We Require

  • Background in Computer Science, Engineering, Information Systems, or other technical field.
  • Willingness and interest to travel to other Palantir locations as needed.
  • Life at Palantir

    We want every Palantirian to achieve their best outcomes, that’s why we celebrate individuals’ strengths, skills, and interests, from your first interview to your longterm growth, rather than rely on traditional career ladders. Paying attention to the needs of our community enables us to optimize our opportunities to grow and helps ensure many pathways to success at Palantir. Promoting health and well-being across all areas of Palantirians’ lives is just one of the ways we’re investing in our community. Learn more at Life at Palantir and note that our offerings may vary by region.

    In keeping consistent with Palantir’s values and culture, we believe employees are “better together” and in-person work affords the opportunity for more creative outcomes. Therefore, we encourage employees to work from our offices to foster connectivity and innovation. Many teams do offer hybrid options (WFH a day or two a week), allowing our employees to strike the right trade-off for their personal productivity. Based on business need, there are a few roles that allow for “Remote” work on an exceptional basis. If you are applying for one of these roles, you must work from the city and or country in which you are employed. If the posting is specified as Onsite, you are required to work from an office.

    If you want to empower the world's most important institutions, you belong here. Palantir values excellence regardless of background. We are committed to making the application and hiring process accessible to everyone and will provide a reasonable accommodation for those living with a disability. If you need an accommodation for the application or hiring process, please reach out and let us know how we can help.