Sr. Staff Software Engineer, Systems Infrastructure (Observability)
Full-time Mid-Senior LevelJob Overview
As part of our world-class software engineering team, you will be building the next-generation observability platforms for LinkedIn, including but not limited to: instrumenting applications and infrastructure to collect meaningful metrics, logs, and traces; Collaborate with development teams to ensure consistent and comprehensive instrumentation across all systems; Develop and maintain libraries, frameworks, and tools to simplify instrumentation. You will work and learn among the best, putting to use your passion for building and scaling observability platforms and your passion for writing code that performs at an extreme scale - come join our Observability Engineering team and share the knowledge with a broader community, while making a real impact within Linkedin.
This role will be based in Bangalore, India.
At LinkedIn, our approach to flexible work is centered on trust and optimized for culture, connection, clarity, and the evolving needs of our business. The work location of this role is hybrid, meaning it will be performed both from home and from a LinkedIn office on select days, as determined by the business needs of the team. The Observability Experience (OE) group is charted to simplify and drive observability for LinkedIn.
The Observability Platforms (sub team of OE) is responsible for billions of operational metrics ingested every minute within LinkedIn. Observability Platforms maintains the platforms and products for metrics ingestion, storage, retrieval, visualization, alerting, incident creation & escalation, remediation, health-checking and experience platform. We are one of the very few companies globally with a custom-built platform to support billions of metrics ingested at millions of QPS.
Responsibilities
- Scale distributed applications, make architectural trade-offs applying relevant design patterns, write code, and deliver speediness and quality.
- Develop multi-tier scalable, high-volume performing, and reliable user-centric applications that operate 24x7. - Produce high-quality infrastructure that is thoroughly tested, code reviewed & resilient.
- Provide technical leadership, driving and performing best engineering practices to initiate, plan, and execute large scale, cross-functional, and company-wide critical programs.
- Identify, leverage, and successfully evangelize opportunities to improve the observability of our stack.
- Deliver impact by driving innovation while building and shipping software at scale
- Implement and manage data aggregation/MELT(Metrics, Events, Logs, Tracing) instrumentation techniques, ensuring seamless integration of logs, traces and metrics from both applications and infra components
- Provide architectural guidance and mentorship to up-level the engineering organization
- Actively improve the level of craftsmanship at LinkedIn by developing best practices and defining best strategies
- Design products/services/tools and code that can be used by others while upholding operational impact of all decisions
- Identify problems and opportunities and lead teams to architect, design, implement and operationalize systems
- Work closely with and influence product and/or technology partners regularly to help define roadmap
- Resolve conflicts between teams within the organization to get alignment and build a cohesive culture
Make Your Resume Now