Site Reliability Engineer (Irving) Job at Optomi, Irving, TX

OTZDbk9nNXN0OFZJb21IbFhQNmdOa29nZ0E9PQ==
  • Optomi
  • Irving, TX

Job Description

Optomi, in partnership with our client, are seeking an experienced SRE II to join their team for a 6 month contract to hire opportunity that is 2 days hybrid onsite in Irving, TX.

W2 only - no C2C/sponsorship at this time.

We are seeking a highly skilled Site Reliability Engineer II to join our engineering organization. This role focuses on building resilient, scalable, and automated systemsnot traditional production support. The ideal candidate has hands-on engineering experience across cloud infrastructure, observability, automation, and reliability-focused development.

You will work closely with development, cloud engineering, and platform teams to ensure high availability, optimal performance, and operational excellence of critical customer-facing applications.

Key Responsibilities

  • Contribute directly to the reliability, scalability, performance, and security of critical applications.
  • Build reusable services, automation, and frameworks that improve platform stability and developer velocity.

Cloud & Platform Engineering

Design and enhance cloud infrastructure using Azure services including:

  • Azure Service Bus
  • Event Hub
  • Azure SQL
  • AKS (Azure Kubernetes Service)
  • Function Apps
  • App Services
  • Implement and manage Infrastructure as Code (IaC) using Terraform.

Containerization & Orchestration

  • Build and deploy containerized applications using Docker (23+ years).
  • Support Kubernetes workloads via AKS, including scaling, upgrades, and cluster reliability improvements.

Development & DevOps

  • Collaborate with development teams using a working knowledge of .NET.
  • Improve CI/CD workflows using Azure DevOps (ADO).

Monitoring, Observability & Incident Response

  • Implement and optimize monitoring and alerting strategies.
  • Use Splunk Observability Cloud (preferred) or equivalent observability platforms to enhance visibility and reduce MTTR.
  • Drive proactive incident identification, root-cause analysis, and long-term fixes.

Performance, Reliability & Scalability Enhancements

  • Design and implement SLOs, SLIs, and error budgets.
  • Develop auto-scaling policies, failover strategies, and disaster recovery procedures.
  • Optimize application and database performance to ensure reliability across high-traffic, mission-critical systems.

Required Qualifications

  • 35+ years of hands-on SRE experience
  • Bachelors degree in Computer Science, Engineering, or a related technical field (or equivalent experience)
  • Masters degree preferred

Hands-on experience with:

  • Azure Cloud (AKS, Service Bus, Event Hub, SQL, Function Apps, App Services)
  • Terraform
  • Docker
  • Azure DevOps
  • Monitoring tools (Splunk Observability Cloud preferred)
  • .NET ecosystem (understanding of development fundamentals)

Preferred Skills

  • Experience designing resilient, distributed systems
  • Strong troubleshooting and analytical skills
  • Performance tuning across applications, databases, and cloud services
  • Experience improving uptime, latency, throughput, or cost efficiency of production applications
  • Familiarity with SRE principles and modern operational practices

Job Tags

Contract work, Part time,

Similar Jobs

USA Truck

CDL A Owner Operator Truck Driver Job at USA Truck

 ...Job Description Job Description CDL-A OWNER OPERATOR TRUCK DRIVER JOBS Experienced CDL-A Owner Operator truck drivers can count on USA Truck for outstanding support while running your business your way, letting you decide how and when to work! NEW! Independent... 

CommonSpirit Health

Urgent Care Physician Job at CommonSpirit Health

 ...Urgent Care Physician at CommonSpirit Health summary: The Urgent Care Physician provides timely medical diagnosis and treatment in a busy urgent care clinic affiliated with a large health system. The role involves patient care, performing procedures, collaborating... 

The Pivot Group Network

Engineering Manager - Byron Center Area Job at The Pivot Group Network

 ...Description Job Description Engineering Manager | Byron Center, MI Salary Range: $117...  ...optimize manufacturing processes, reduce waste, and enhance product quality. Ensure...  ..., and continuous improvement. Solid project management skills, including scope... 

BJ's Wholesale Club

Bakery Associate Job at BJ's Wholesale Club

 ...Wholesale Club, a leader in the membership-only warehouse retail segment in the eastern United States, is looking for a dedicated Bakery Associate to join our vibrant food and beverage team. Our Bakery Associates play an essential role in maintaining our reputation for... 

Amtrak

Train Conductor Trainee Launch Your Rail Career Job at Amtrak

 ...A major railway company in Washington, DC, is seeking a PASSENGER CONDUCTOR TRAINEE. Responsibilities include assisting the Conductor in train movements and ensuring passenger safety. Candidates need some work experience, the ability to lift 50 lbs, and a valid driver...