Site Reliability Engineer | Schlumberger

Job Details

Site Reliability Engineer

Abingdon - United Kingdom

Job title:

Site Reliability Engineer

 

Location:

Abingdon, United Kingdom – Abingdon Technology Center https://www.youtube.com/watch?v=H13CBeudfXY

 

About Schlumberger:

We are Schlumberger, the leading provider of technology and services to the energy industry. Throughout much of the oil and gas lifecycle in over 120 countries; we design, develop, and deliver technology and services that transforms how work is done.

We define the boundaries of the industry by unleashing our talented people’s energy. We’re looking for innovators to join our diverse community of colleagues and develop new solutions and push the limits of what’s possible. If you share our passion for discovery and want to find out what you could really do, then here is the place to do it.

 

Job Description:

At Schlumberger Abingdon, UK, we are in the search of a talented and enthusiastic Site Reliability Engineer to work in a team responsible for the end-to-end delivery of a new solution to help Oil and Gas operators to manage and reduce their methane emissions.  You will work on a fast pace agile environment, frequently running experiments to validate assumptions, where your contributions and decisions will have a direct impact on outcomes. 

The successful candidate will be responsible for the reliability and uptime of the product and engaging in improving the whole lifecycle of services from inception and design, through deployment, operation and refinement.

The solution will touch on several fronts such as data ingestion, storage, processing, monitoring and reporting; with the ambition to also has a mobile app to support users on the road.  You may expect to face plenty of challenges and there will be many opportunities to learn, adapt, and grow.   

 

Essential Responsibilities and Duties:

It is expected that the successful candidate will be able to: 

  • Maintain and improve services once they are live by measuring and monitoring availability, latency and overall system health.
  • Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
  • Gauges the effectiveness and efficiency of existing systems and infrastructure; implements strategies for improving or further leveraging these systems.
  • Collaborates with network and security staff to ensure smooth, secure and reliable operation of application software and systems.
  • Develops, implements and documents best practice policies and procedures for new projects or initiatives.
  • Effectively uses the service management systems, ensuring that best practices and lessons learned are made available to wider technical community.
  • Engaged in incident response and blameless postmortems.
  • Maintains a broad knowledge of state-of-the-art computer technology, equipment, and systems; participates in professional development activities as appropriate.
  • Support for SRE tooling such as: Rundeck, Pagerduty, Stackdriver, PAM access (cyber Ark), Operational Readiness (Internal process), DR/Incident Drills, Incident reports, Cost Dashboards, Billing exports, certificates etc.
  • Standard incident response and postmortems.

 

Qualifications and competencies:

The candidate must hold a minimum Bachelor’s degree in Information Technology or Computer Science.

The successful candidate should be familiar with the following technology:

  • Kubernetes, Dockers, Istio
  • Azure, Azure DevOps
  • Graphana, Prometheus, NoSQL and SQL DBs

The following would be preferred:

  • Strong in Software Engineering.
  • Interest in designing, analyzing and troubleshooting large-scale distributed systems.
  • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
  • Ability to debug and optimize code and automate routine tasks.

The selected candidate should possess:

  • Ability to work independently and excellent team player.
  • Excellent problem-solving skills.
  • Ability to foster and maintains excellent internal, client and third-party relationships.
  • A high degree of initiative.
  • Adaptability and willingness to learn new technologies; keeps abreast of key developments in relevant technologies.
  • Ability to work under pressure in a fast-paced environment.
  • Excellent oral, written communication, and interpersonal skills.
  • Effective listening techniques.
  • Ability to effectively analyze and solve problems with attention to the root cause.

 

Compensation & Benefits:

  • Competitive package (£50,000 - £90,000) including performance related bonus
  • Private healthcare for employee + family
  • Flexible working: 2 days in office are required, 3 days flexible
  • Subsidised dental care
  • Health & Wellbeing programs
  • Relocation: If an employee lives more than 40 miles away from the office then they are eligible for relocation. This comes with:

1.       Custom Relocation assistance

2.       Relocation Allowance – 2 months’ base salary

  • Employee Mental health support, health & wellness coaching
  • Employee discounted share purchasing scheme & pension contribution (up to 6%)
  • Generous income protection scheme, life insurance (4 times base salary, min 150K)
  • Other benefits through flexible benefits program (Cycle to Work, salary sacrifice, option to select additional insurances like travel insurance, health screening…)

 

Schlumberger is an equal employment opportunity employer. Qualified applicants are considered without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, or other characteristics protected by law.

 

Site Reliability Engineer
Log in to apply for this position today.
Apply Now

Share This