Remote Site Reliability Engineer (Kubernetes Focus)

Job Overview

Location
Hyderabad, Telangana, India
Job Type
FULL_TIME

Additional Details

Job ID
16999
Job Views
331

Job Description

Attention Job Seekers

  • We offer a free job service by consolidating opportunities from various sources on our website.
  • Scammers have been requesting payments for job applications. Please be aware that we never ask for any payment.
  • Do not send any money for job applications.
  • If anyone has made a payment, kindly send us an email with the details at techsprink@gmail.com

Job Summary

Houston Skilled Consultancy is seeking a Remote Site Reliability Engineer (SRE) with a strong focus on Kubernetes to ensure the reliability, scalability, and performance of our clients cloud-based infrastructure. The ideal candidate will bring a deep understanding of container orchestration, cloud-native technologies, and systems automation, ensuring systems run smoothly and efficiently at scale. This is a high-impact role ideal for professionals passionate about DevOps practices, modern cloud architecture, and continuous improvement.


Key Responsibilities

  • Design, implement, and maintain highly available and scalable Kubernetes clusters across multi-cloud environments.

  • Monitor infrastructure and system performance using observability tools like Prometheus, Grafana, and ELK stack.

  • Troubleshoot and resolve production issues, conducting root cause analysis and implementing preventative measures.

  • Develop and maintain infrastructure as code (IaC) using Terraform or Helm.

  • Automate repetitive tasks and streamline CI/CD pipelines for infrastructure and applications.

  • Collaborate closely with development, QA, and security teams to enhance deployment workflows.

  • Ensure system security through proper configuration, access controls, and policy implementation.

  • Define and manage SLOs/SLAs to meet service reliability standards.

  • Participate in on-call rotations to handle incident response and service recovery.


Required Skills and Qualifications

  • Proven experience as an SRE, DevOps Engineer, or Systems Engineer with focus on Kubernetes.

  • Proficiency in Kubernetes (EKS, AKS, GKE) setup, management, and troubleshooting.

  • Strong experience with Linux systems administration and shell scripting.

  • Hands-on expertise with CI/CD tools such as Jenkins, GitHub Actions, or GitLab CI.

  • Strong understanding of containerization technologies (Docker) and service meshes (Istio, Linkerd).

  • Experience with monitoring and alerting tools (Prometheus, Grafana, Alertmanager).

  • Skilled in using Infrastructure as Code (IaC) tools such as Terraform, Ansible, or Helm.

  • Familiarity with networking, load balancing, and security best practices in cloud environments.

  • Ability to code in Python, Go, or Bash for automation tasks.


Experience

  • Minimum: 3–5 years of relevant experience in SRE, DevOps, or related cloud infrastructure roles.

  • Preferred: Experience managing production-grade Kubernetes environments in multi-cloud setups.


Working Hours

  • Flexible remote working schedule with availability for critical incident response and scheduled on-call rotations.

  • Core collaboration hours: 10:00 AM – 6:00 PM IST with flexibility depending on project needs.


Knowledge, Skills, and Abilities

  • Deep understanding of distributed systems, cloud-native architectures, and microservices.

  • Strong analytical and troubleshooting abilities in complex, high-availability environments.

  • Ability to handle high-pressure incidents calmly and efficiently.

  • Strong communication and collaboration skills to work with cross-functional teams.

  • Demonstrated ability to document processes, runbooks, and incident post-mortems.


Benefits

  • 100% Remote Work Opportunity

  • Competitive Salary and Annual Performance Bonuses

  • Paid Time Off and National Holidays

  • Professional Development Allowance

  • Health & Wellness Reimbursements

  • On-Call Compensation

  • Opportunity to work with cutting-edge cloud-native technologies


Why Join Houston Skilled Consultancy?

At Houston Skilled Consultancy, we are not just building infrastructure—we are building careers. Our remote-first culture empowers top talent from across the globe to thrive in challenging and rewarding projects. You will join a team of forward-thinking engineers who value innovation, reliability, and collaboration. If you are looking to grow your expertise in SRE while shaping the backbone of scalable applications, this is the place for you.


How to Apply

To apply, please submit your updated resume and a brief cover letter highlighting your experience with Kubernetes and Site Reliability Engineering to us
Subject Line: Application for Remote Site Reliability Engineer (Kubernetes Focus)

Only shortlisted candidates will be contacted for interviews.

Similar Jobs

Houston Skilled Consultancy

Legal Research Assistant - Remote Law Job

FULL_TIME