Cloud Infrastructure Site Reliability Engineer (SRE) (Berkeley Heights) Job at Intelliswift - An LTTS Company, Berkeley Heights, NJ

K2ErdE9nRmpzOHREbzJIcFhmcXNORTBtZ2c9PQ==
  • Intelliswift - An LTTS Company
  • Berkeley Heights, NJ

Job Description

Job Posting Title: Cloud Infrastructure Site Reliability Engineer (SRE)

Location: Alpharetta, GA or Berkeley Heights, NJ

Position Summary:

As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in multiple public cloud service provider platforms, you will be responsible for operating infrastructure solutions, following the principles and practices pioneered by Googles SRE model. Your work will ensure our cloud services meet uptime, reliability, and performance targets, and you will drive automation and continuous improvement across our production environments. This role will involve collaborating with cross-functional teams to enhance our cloud reliability posture and streamline processes through automation.

Key Responsibilities:

Design, build, and maintain highly available, scalable, and secure cloud infrastructure on platforms such as AWS, GCP, or Azure.

Develop and implement automation for provisioning, monitoring, scaling, and incident response using Infrastructure-as-Code tools (e.g., Terraform, CloudFormation, Ansible).

Monitor system reliability, capacity, and performance; proactively detect and address issues before they impact users.

Respond to production incidents, participate in on-call rotations, and lead post-incident reviews to drive root cause analysis and reliability improvements.

Collaborate with software engineering and security teams to ensure new services and features are production-ready and meet reliability standards.

Build and maintain tools for deployment, monitoring, and operations; automate manual processes to reduce toil.

Document operational processes and system architectures to ensure knowledge sharing and repeatability.

Continuously evaluate and implement new technologies to improve system reliability, security, and efficiency.

Qualifications:

Bachelors degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.

3+ years of experience in software development with proficiency in at least one programming language (e.g., Python, Go, Java, C++).

Experience administrating cloud platforms (AWS, GCP, Azure), including networking, security, containerization, storage, data management, and serverless technologies.

Solid understanding of Linux systems, networking fundamentals, virtualized, and distributed systems, file systems, system processes and configurations.

Deep understanding of observability (monitoring, alerting, and logging) tools in cloud environments. Ability to set up and maintain monitoring dashboards, alerts, and logs.

Familiarity with Continuous Integration/Continuous Deployment (CI/CD) tools for automated testing, deployments, provisioning, and observability.

Ability to manage and respond to incidents, perform root cause analysis, and implement post-mortem reviews.

Understanding of setting, monitoring, and maintaining Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) for system reliability.

Additional Qualifications a Plus:

Experience working with enterprise-scale financial services or other regulated industries

5+ years of experience in SRE, DevOps, infrastructure, or cloud engineering roles, preferably supporting large-scale, distributed systems.

Excellent problem-solving, troubleshooting, and communication skills.

Experience leading technical projects or mentoring junior engineers.

Certifications: Certified Engineer, DevOps, SRE, CSREF

Job Tags

Part time,

Similar Jobs

Humana

Innovation and Rapid Prototyping Principal Job at Humana

 ...new technologies like AI. Preferred Qualifications Master's or Doctorate's degree Experience with analytics for Medicare To ensure Home or Hybrid Home/Office employees' ability to work effectively, the self-provided internet service of Home or... 

Vista Staffing

Locum Tenens Nurse Practitioner - Cardiology Job at Vista Staffing

 ...Job Description Vista Staffing is seeking a Nurse Practitioner Cardiology for a locum tenens job in Milwaukee, Wisconsin. Job Description & Requirements ~ Specialty: Cardiology ~ Discipline: Nurse Practitioner ~ Duration: 40 weeks ~40 hours per week... 

Martek Global Services Inc.

Jr. Real Estate Analyst (Hybrid/Telework) - Seattle, WA Job at Martek Global Services Inc.

 ...Inc. (Martek) has been awarded several long-term Federal contracts to provide a wide range of real estate talent. We are currently looking for Jr. Real Estate Analysts with the required specialized training and experiences outlined below. Hybrid Telework : (3)... 

GBR Transport

CDL Class A Tanker Truck Driver Job at GBR Transport

 ...Job Description Job Description CDL-A Pneumatic Tanker Driver 31% Gross Pay ($1,800 - $2,100+/wk) GBR Transport Alvarado/Fort Worth, TX Are you a disciplined, experienced Class A driver looking for a high-earning 1099 position with a steady, local-ish route... 

Aerodyne Industries

ADV000BDR Mid Linux/Elastic Systems Administrator (J) Job at Aerodyne Industries

 ...Description of Duties: Position Title: Mid Linux/Elastic Systems Administrator Location:Schriever Space Force Base, Colorado Springs, CO...  ...AL Relocation Assistance:None available at this time Remote/Telework:NO - Not available for this position Clearance...