Site Reliability Engineer

  • Pune, Maharashtra, IN

  • Software Engineering
  • Professional

Site Reliability Engineer

  • Pune, Maharashtra, IN

  • Software Engineering
  • Professional

Introduction
At IBM, work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems? If so, lets talk.

Your Role and Responsibilities
As a Virtualization Platform Engineer, you will be part of the Cirrus Hybrid Cloud virtualization team responsible for ensuring the architectural integrity and successful delivery of a scalable virtualization platform for the IBM CIO Organization.
In this role you will focus on the management of virtualization platform for Cirrus Hybrid Cloud. This entails working on all aspects designing, engineering, implementing, and maintenance of various virtualization solutions.
You will help solve intriguing problems while partnering with other team members, customers, and vendors. For success in this role, you will have a strong Python or Ruby programming language background and a passion for learning and continuous improvement.
What you will do:

  • Design, Management, maintenance, and support of various virtualization solutions especially the RedHat OpenShift Virtualization (OSV) and VMware.
  • Create infrastructure using any from: Ansible, Terraform, Argo, OpenShift IPI, UPI, ZTP Zero Touch Provisioning
  • Operate in an agile manner and under strict change control
  • Maintain the environment according to the Policy Compliance Management requirements.
  • Troubleshoot and resolve Hypervisor/Operating System-based issues from Performance to Configuration
  • Backing up and protect virtual environments using platform-specific tools
  • Perform daily system checks, review, and respond to events reflected in various management tools, perform server patch management.
  • Conduct system audit reviews and perform maintenance functions as required to ensure system health.
  • Troubleshoot and resolve problems for all applications.
  • Support, implement and maintain new applications coming into the environment.
  • Present status information on issues and problems at the weekly team meetings.
  • Document software changes.
  • Document problem resolution steps.
  • Assure best-practices and standards are implemented and adhered to for software systems
  • Provide on-call support and implementation after-hours on a rotating basis
  • Think and act like a Site Reliability Engineer (SRE) as the environment relates to virtualization


Required Technical and Professional Expertise

  • A minimum of 2-3 years’ experience in Virtualization & Automation as a Site Reliability Engineer.
  • OpenShift / Kubernetes administration such as building, patching, debugging, and maintaining clusters
  • Experience designing, building, and supporting large-scale production systems
  • Experience in Python, Golang and Ruby programming languages
  • Strong knowledge of server virtualization technology in VMware and cloud infrastructure
  • Strong Linux (RHEL) and system administration and networking skills


Preferred Technical and Professional Expertise

  • Able to describe and traverse Kubernetes or OpenShift internals, deep familiarity with how a cluster works and the underlying architecture and its components
  • Administration or usage of cloud native delivery solutions such as ArgoCD or Flux
  • Experience automating infrastructure, configuration management, testing, and deployments using tools like Ansible, Chef and can explain the Infrastructure as Code paradigm
  • Familiarity with managing and securing a distributed Windows Server and/or Linux environment

Vous voulez savoir ce que c’est que d’être un IBMer ?


About IBM

IBM’s greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world.

Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we’re also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business.

At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it’s time for you to join us on our journey to being a responsible technology innovator and a force for good in the world.

Détails clés de l’offre

Vous ne trouvez pas votre bonheur en ce moment ?

Ne vous inquiétez pas. Rejoignez notre réseau de talents et recevez des informations sur les dernières opportunités.