Netflix is one of the world's leading entertainment services, with over 300 million paid memberships in over 190 countries enjoying TV series, films and games across a wide variety of genres and languages. Members can play, pause and resume watching as much as they want, anytime, anywhere, and can change their plans at any time.
This role is part of the Exploration and Troubleshooting team, a key part of our Observability engineering group, the “eyes and ears” of Netflix engineering.
Observability engineering provides the platform and suite of products that allow Netflix engineers to understand how their services behave in real-time, detect anomalies in system health, and troubleshoot and remediate problems. Our platform processes billions of data points in real time every minute. The success of our platform and products is crucial to Netflix's success and our ability to operate the Netflix cloud.
What you will do:
Design and implement the distributed backbone for Netflix Observability's agentic AI-driven analysis, inference, and orchestration systems.
Develop robust ingestion and correlation layers that unify signals from logs, metrics, traces, and alerts across cloud and on-prem environments.
Optimize and extend workflows for real-time, actionable recommendations and RCA automation.
Collaborate cross-functionally with Observability, SRE, and Platform teams to scale AutoSRE as a self-serve, extensible AI agent for Netflix engineering.
You will thrive in the role if:
Well-informed opinions : You have well-informed opinions on subjects and do not shy away from making decisions.
Provide and take feedback: You are proactive about soliciting and providing feedback (technical, behavioral, instilling team norms).
Selfless: You seek what is best for Netflix and make time to help colleagues across the team succeed.
Comfortable with ambiguity: You are curious and enjoy working on ambiguous problems where the solutions still need to be defined. You excel at cross-functional ownership and driving alignment to reach a decision, even when it is outside of your wheelhouse
A deep sense of ownership: You take deep ownership of your projects and feel pride in delivering great work with attention to the details
Share Knowledge: You like to share your learnings and mentor folks around you in your areas of expertise.
Desired Background:
Industry Experience: You have 8+ years of software engineering experience.
GenAI stack: You have strong interest and experience with the latest GenAI stack (LLMs, RAG, Agents). You have familiarity with workflow engines like Temporal, AWS Lambda, AWS AgentCore, LangGraph, AI Observability systems like Braintrust
Distributed systems : You have experience in building and operating scalable, observable, fault-tolerant, distributed systems. You have experience with AWS services.
Tech stack: You are proficient in Java, GRPC, Python. Familiarity with Scala is a plus.
Full lifecycle engineer You are knowledgeable about and are willing to own all areas of the software lifecycle: design, development, test, deploy, operate, and support.
Nice To Have
Observability Experience: You have extensive knowledge about or built observability products like logs, metrics, and traces.
Our compensation structure consists solely of an annual salary; we do not have bonuses. You choose each year how much of your compensation you want in salary versus stock options. To determine your personal top of market compensation, we rely on market indicators and consider your specific job family, background, skills, and experience to determine your compensation in the market range. The range for this role is $100,000 - $720,000.
Inclusion is a Netflix value and we strive to host a meaningful interview experience for all candidates. If you want an accommodation/adjustment for a disability or any other reason during the hiring process, please send a request to your recruiting partner.
We are an equal-opportunity employer and celebrate diversity, recognizing that diversity builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate on the basis of race, religion, color, ancestry, national origin, caste, sex, sexual orientation, gender, gender identity or expression, age, disability, medical condition, pregnancy, genetic makeup, marital status, or military service.
...Horizontal Directional Drill (HDD) Rig Operator - Communications 3 year's experience operating Vermeer HDD's 10X15 up to 24X40 required. Will include weekly after hours on-call work and alternating weekend on call rotation. Must have valid driver's license....
...Description Self Storage Consulting Group (SSCG) a Real Estate Development company is a leading provider of real estate development. SS Architecture, LLC is a subsidiary and in-house architectural design arm of SSCG. Functions: Preparing detailed drawings showing both...
...Customer Service Representative I is a call center position responsible for being the first... ...maintenance, and repairs for all facilities, home care, and hospice providers. Act as... ...to speak clearly and articulately when working with internal and external customers on...
...on peoples faces? Then you might be an excellent fit for the hotel front desk agent position on our guest services team! Youll welcome... ...the hospitality industry as a hotel front desk agent, hotel receptionist, or guest service specialist preferred Working knowledge of...
...At Coffman Engineers, we serve as both prime consultant and sub consultant on projects large and small, including commercial, retail... ...teams comprised of civil, structural, mechanical, electrical, fire protection, and corrosion control, as well as project/construction...