Back to Jobs
Databricks

Staff Software Engineer, Observability at Databricks

Databricks Mountain View, CA

Job Description

RDQ426R299At Databricks we are passionate about enabling data teams to solve the worlds toughest problems from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the worlds best data and AI infrastructure platform so our customers can use deep data insights to improve their business.Our engineering teams build technical products that fulfill real important needs in the world. We always push the boundaries of data and AI technology while simultaneously operating with the security and scale that is important to making customers successful on our platform.We develop and operate one of the largest-scale software platforms. The fleet consists of millions of virtual machines generating terabytes of logs and processing exabytes of data per day. At our scale we observe cloud hardware network and operating system faults and our software must gracefully shield our customers from any of the above.As a software engineer in the Observability team you will develop observability solutions that provide insights into the health and performance of our products and infrastructure.The impact youll have:You will build the next generation of observability platforms that support billions of active time series and process petabytes of logs daily.You will manage infrastructure across nearly a hundred cloud regions enabling all Databricks engineers and customers to monitor the reliability of our product.You will develop advanced workflows that accelerate incident diagnosis for Bricksters allowing engineers to quickly derive insights from logs and metrics. You will leverage powerful capabilities of Databricks own data intelligence platform to push the boundaries of troubleshooting practices in the industry.You will uplevel monitoring and reliability practices across Databricks engineering developing opinionated tools that set common standards for managing structured logs metrics alerts dashboards and oncall rotations.Mentor and uplevel engineers fostering a culture of technical excellence within the team and broader observability community.What we look for:BS (or higher) in Computer Science or a related field.7 years of production-level experience in one of: Go Python Java Scala Rust C or similar languages.Experience in software development in large-scale distributed systems.Experience driving large projects involving multiple teamsExperience with cloud technologies e.g. AWS Azure GCP Docker or Kubernetes.Familiarity with observability infrastructure monitoring patterns and reliability practices.Required Experience:Staff IC Key Skills Campaigns,JSP,Dhtml,Loans,Automobile Employment Type : Full-Time Experience: years Vacancy: 1

Resume Suggestions

Highlight relevant experience and skills that match the job requirements to demonstrate your qualifications.

Quantify your achievements with specific metrics and results whenever possible to show impact.

Emphasize your proficiency in relevant technologies and tools mentioned in the job description.

Showcase your communication and collaboration skills through examples of successful projects and teamwork.

Explore More Opportunities