Updating Results

Salesforce India

  • #14 in Technology
  • 1,000 - 50,000 employees

Site Reliability Engineer - All Levels

India

Opportunity Expired

We believe everyone can be a Trailblazer. Join Salesforce and discover a future of new opportunities.

Opportunity details

Opportunity Type
Graduate Job
Start Date
Ongoing

Application dates

Applications Open
30 Sep 2021
Applications Close
31 Oct 2021

Minimum requirements

Accepting International Applications
No
Qualifications Accepted
I
Artificial Intelligence
Bioinformatics
Computer Graphics & Animation
Computer Science (all other)
Computer Systems and Networks
Cyber Security
Data Science
Design & User Experience
Programming & Software Engineering
Video Game Development

Hiring criteria

Bachelor of Information Technology

See details

Job Description

When not fighting fires, the team is responsible for fire prevention through monitoring, automation, self-healing and resiliency initiatives, destructive testing, and game day exercises. The incumbent in this role would demonstrate a strong focus on tactical operations, as well as large-scale production engineering and orchestration.

Responsibilities

  • Keep the customer-facing services available at top performance by maintaining the constant health of the supporting systems.

  • Incident management - Act in key response roles during major incidents e.g. Sev0, Sev1. Also, participate in the technical review of the incident for problem management

  • Problem Management - populate in participate in (Root Cause Analyses (RCAs) and hand them off to the Global Solutions team

  • Ensuring that work carried out by the Site Reliability team is executed in such a way as to comply with the company’s internal compliance policy and directives

  • Being available to discuss and resolve technical issues and escalations with other technical staff as required

  • Work with and lead other members of the team in staying on top of key industry innovation and technology, and assist in team development growth

  • Identifying work opportunities and preparing or assisting with the preparation of technical proposals as required

  • Ability to operate in a high-pressure environment and troubleshoot complex issues quickly successfully handle multiple priorities

  • Work to automate detection and resolution of recurring issues in the production environment

  • Gain a deep understanding of the application and its inner workings and be able to pinpoint code defects to speed up remediations

Basic Requirements

 

  • Systems engineering experience in enterprise-scale internet service engineering or related role

  • Experience with monitoring implementations and administration

  • Strong communication skills (Written and Oral)

  • Past experience in Incident Management for customer-facing applications

  • Experience in working in a 24/7 team

Preferred Qualifications

 

  • Python/BASH/GO scripting experience

  • Prior Automated deployment experience

  • Experience in maintaining monitoring and alert systems

  • Experience troubleshooting relational databases and distributed platforms

  • Experience in maintaining Java and GO applications

  • Experience in Docker orchestration and management.

  • Experience with Kubernetes

  • Hands-on experience configuring and managing AWS (Amazon Web Services), using the CLI/SDKs

  • Experience managing systems monitoring and alerts.

  • Experience with JVM optimization and Java server technologies like Tomcat or Jetty

Hiring criteria

You should have or be completing the following to apply for this opportunity.

Bachelor of Information Technology
Degree or Certificate
Minimum Level of Study
Bachelor or higher
From an Institution in
  • India
Study Field
I
Artificial Intelligence
Bioinformatics
Computer Graphics & Animation
Computer Science (all other)
Computer Systems and Networks
Cyber Security