- Abbott (Pleasanton, CA)
- …generic medicines. Our 114,000 colleagues serve people in more than 160 countries. **Staff Site Reliability Engineer ** **Working at Abbott** At Abbott, you ... work for diversity, working mothers, female executives, and scientists. **The Opportunity** **Staff Site Reliability Engineer ** As senior member of Site … more
- Rubrik (Palo Alto, CA)
- …infrastructure services run smoothly and have the capacity for future growth. As a Senior Site Reliability Engineer , you will be responsible for: + Ensure we ... technologies + Minimum 3-5 years of experience as a Development, DevOps or Site Reliability Engineer Willing to provide 24/7 coverage + Strong Documentation… more
- Rubrik (Palo Alto, CA)
- …to make an impact on product stability and success. **What you'll do:** As a Senior Site Reliability Engineer , you will be responsible for: + Manage and run ... technologies + Minimum 5 years of experience as a Development, DevOps or Site Reliability Engineer + Willing to provide 24/7 coverage + Strong Documentation… more
- NVIDIA (Santa Clara, CA)
- …on the world. NVIDIA is looking to hire a deeply technical, creative, and Staff Site Reliability Engineer to build, support and maintain the next generation ... architects, and business teams to ensure optimal operation and reliability of applications. + Define and lead technical roadmap...are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want… more
- EPAM Systems (San Jose, CA)
- EPAM is hiring a **Remote Lead Site Reliability Engineer ** . If you are looking for a high-impact, exciting role with a company that leads the globe in the ... the value of SRE, mentor and train other engineers around proactive reliability decision making and planning + Review code instrumentation with development teams… more
- NVIDIA (Santa Clara, CA)
- …accelerate the next wave of artificial intelligence. Join our team at NVIDIA as a Senior Site reliability engineer focused on HPC storage and play a crucial ... role in designing, implementing, and optimizing on-prem High-Performance Computing (HPC) storage solutions while harnessing the power of cloud computing. You will be responsible for crafting and deploying distributed storage solutions, build automation tools,… more
- Tarana Wireless (Milpitas, CA)
- As a Senior Site Reliability Engineer , you will help us manage software that runs on the cloud and remotely manages millions of radio devices. You will work ... on a team and be a main point of contact during off shore hours and responsible for all aspects of cloud operations, such as: + Infrastructure as Code + Manage environments in AWS + Monitoring and alerting + Automation for scaling, failure recovery, disaster… more
- Walmart (Sunnyvale, CA)
- Position Summary What you'll do **Principal Site Reliability Engineer :** This position is responsible for the operation of a department. An individual in ... align with site environment changes. Integrates the business goals of site reliability engineering and site safety engineering. Trains team members on… more
- General Motors (Palo Alto, CA)
- …this exciting journey toward a better future **.** **Responsibilities:** + Lead Site Reliability engineering effort to improve anomaly detection, platform ... + Participate in on-call engineering duty to support production. + Instill Site Reliability best practice through automation, data insights, and observability… more
- Splunk (San Jose, CA)
- …journey! **Role:** _Splunk_ 's _Cloud_ Services group is looking for aS _ite Reliability_ Engineer to help lead, design and build the next generation of our _large ... scale cloud_ offering. You will be working on core services and applications that form the primitives for our current and future cloud service offerings. _Site Reliability_ Engineers in this role will be engaging with multiple service owners across the… more
- NVIDIA (Santa Clara, CA)
- Site Reliability Engineering (SRE) at NVIDIA is an engineering discipline to design, build and maintain large scale production systems with high efficiency and ... internal and external facing GPU cloud services run maximum reliability and uptime as promised to the users and...be doing: + Design, implement and support operational and reliability aspects of large scale Kubernetes clusters with focus… more
- Palo Alto Networks (Santa Clara, CA)
- …with the DevOps and the RND team to develop new features and maintain high reliability for our SAAS Products (XDR, XSIAM, XSOAR and XSPANSE) + Work with the US ... product features **Your Experience** + Expert level experience as a DevOps/SRE engineer with a passion for technology and strong motivation and responsibility for… more
- Netflix (Los Gatos, CA)
- …the world is a hard challenge, demanding exceptional levels of stability and reliability from dozens of services and systems between camera and device screens. About ... a Live Streaming Pipeline SRE, you will be responsible for the reliability of our live streaming pipeline (transmission, encoding, packaging, origin). Instrumenting… more
- IBM (San Jose, CA)
- …managers, and other stakeholders to understand requirements and ensure the reliability of the platform. Continuous Improvement: Participate in post-incident reviews, ... including knowledge of best practices for scalability, performance, and reliability . + Experience with Monitoring and Observability: Experience with advanced… more
- Cisco (San Jose, CA)
- …with either Micro-Services Architecture development, DevOps Engineering, Cloud Operations, Site Reliability Engineering, Services Engineering or Information ... Technology Preferred Qualifications * 3 + years of public cloud experience with Linux, Kubernetes, and AWS * Experience with GitHub, Docker, k9s, Argo * Development experience, with Python or other language * Team Player - able to collaborate with various… more
- Zoom (San Jose, CA)
- …clusters within different infrastructures. You will also design and implement reliability best practices to accomplish a highly available service (99.99%). ... Additionally, you will identify and fix problems in Kubernetes operators, submitting code fixes to OSS if needed. Contributing to capacity planning, and anticipating performance bottlenecks are critical to success. You will also troubleshoot production issues… more
- Zoom (San Jose, CA)
- …& experience. We also have a location based compensation structure; there may be a different range for candidates in this and other locations. Ways of WorkingOur ... structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting. BenefitsAs part of our award-winning workplace culture and commitment… more
- EPAM Systems (San Jose, CA)
- …of all other expected benefits and compensation for the position EPAM Systems, Inc. is an equal opportunity employer. We recognize the value of diversity and ... We are looking for a candidate to join a multi-functional SRE team with the focus on Google Cloud Platform. You should have cloud engineering experience in such areas acting as the SME on operation automation and monitoring, identifying TOIL within the team's… more
- Zoom (San Jose, CA)
- …& experience. We also have a location based compensation structure; there may be a different range for candidates in this and other locations. Ways of WorkingOur ... structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting. BenefitsAs part of our award-winning workplace culture and commitment… more
- IBM (San Jose, CA)
- …and Responsibilities Location preference the Silicon Valley area We're looking for an experienced Site Reliability Engineer to join our team. At IBM, the ... Software Defined Networking (SDN) business which includes IBM Hybrid Cloud Mesh, NS1 and other offerings focuses on software based networking, an architecture approach that enables network to be intelligently and centrally controlled using software with main… more