Senior DevOps Engineer (Monitoring - Grafana, Prometheus)

See more jobs from Binance

9 months old

Apply Now

Binance is a leading global blockchain ecosystem behind the world’s largest cryptocurrency exchange by trading volume and registered users. We are trusted by over 230 million people in 100+ countries for our industry-leading security, user fund transparency, trading engine speed, deep liquidity, and an unmatched portfolio of digital-asset products. Binance offerings range from trading and finance to education, research, payments, institutional services, Web3 features, and more. We leverage the power of digital assets and blockchain to build an inclusive financial ecosystem to advance the freedom of money and improve financial access for people around the world.

Responsibilities:

  • Design, implement, and manage comprehensive monitoring solutions to ensure high availability, performance of our microservices infrastructure and applications.
  • Utilize advanced monitoring tools and scripting to automate the monitoring of our cloud environments, focusing on AWS.
  • Develop and maintain robust logging and alerting mechanisms to identify and mitigate potential issues proactively.
  • Collaborate with infra team to integrate monitoring solutions into the CI/CD pipeline, ensuring seamless deployments and operations.
  • Conduct performance analysis, capacity planning, and scalability testing to ensure our systems meet current and future demands.
  • Lead incident response and troubleshooting efforts, utilizing monitoring data to quickly resolve operational issues.
  • Requirements:

  • Minimum of 5 years of hands-on experience with Kubernetes, Elasticsearch, Promtheus, Grafana and AWS, with a strong emphasis on monitoring and observability in cloud-native environments.
  • Proficiency in programming languages (such as Python, Go or Rust) for automation of monitoring tasks.
  • Experience with infrastructure as code (IaC) tools, and strong understanding of CI/CD principles, including experience with Docker and Kubernetes for container orchestration.
  • Deep knowledge monitoring tools (such as Prometheus, Grafana or ELK stack) and strategies for large-scale environments.
  • Proven track record in managing and troubleshooting large-scale distributed systems, with an emphasis on performance tuning and optimization.
  • Excellent problem-solving skills, with a focus on delivering high-quality, reliable, and scalable infrastructure solutions.
  • Strong communication and teamwork skills, with the ability to work effectively in a fast-paced, collaborative environment.

  • Why Binance
    • Shape the future with the world’s leading blockchain ecosystem
    • Collaborate with world-class talent in a user-centric global organization with a flat structure
    • Tackle unique, fast-paced projects with autonomy in an innovative environment
    • Thrive in a results-driven workplace with opportunities for career growth and continuous learning
    • Competitive salary and company benefits
    • Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

    Binance is committed to being an equal opportunity employer. We believe that having a diverse workforce is fundamental to our success.
    By submitting a job application, you confirm that you have read and agree to our Candidate Privacy Notice.