Job Details
Location:
1950 Summit Park Dr, Orlando, FL 32810, USA
AFI Park 2, Bulevardul General Vasile Milea 4, București 061344, Romania
Posted:
Apr 10, 2024
Job Description
EA is looking for an experienced Infrastructure Site Reliability Engineer (SRE) with a strong understanding of On-premises Infrastructure, cloud computing, Database, and Virtual technologies to join us.
As Site Reliability Engineer (SRE) you will support us and our complex systems running VMware vSphere v7/8x and infrastructure/applications hosted on the EA Private and Public clouds, like AWS/Azure/GCP. You will also be working in a team environment, following established business processes and procedures, preparing knowledge documentation SOPs, and performing changes and preventative maintenance to ensure the smooth day-to-day operation of EA's computing environment.
This role reports into the Senior Manager.
Job Requirements/Role
SRE's responsibilities include:
- You will be maintaining reliability and performance, fixing issues and errors, automating tasks, responding to incidents, and managing on-call responsibilities.
- You will build infrastructure-based processes or methodologies to be used by system, Database, cloud, and Linux engineers in cross-functional environments.
- You will collaborate with project and product managers to ensure that the stated vision for a service is compatible with non-functional system requirements like performance, latency, availability, and security.
- You will work with engineering teams at the staging phase of the build process to ensure optimal delivery efficiency.
- You will act as SME for the operation of resilient distributed systems, Databases that run across multiple data center and cloud providers
- Directly influence our journey towards zero-touch, highly scalable, reliable infrastructure.
- Resolve complex technical issues and drive innovations that improve system, storage, Database, availability, and performance.
- Deliver a solid playbook of operative guidelines, eliminating hands-on work and redundancy.
Skills/Experience
- 7+ years of progressive, technical experience—expertise in the Configuration Management System, VMware, RedHat Linux, MSSQL, MYSQL, Postgres, RDS, and Kubernetes.
- Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
- Understanding of development and operations
- Familiarity with production monitoring systems
- Ability to collaborate across multi-functional teams
- Experience with scripting and automation using languages like Python, BASH, PowerShell, HTML + CSS, Django, CI/CD, and Teraform.
- Strong background in continuous integration and deployment (CI/CD) practices.
- Ability to diagnose and resolve technical issues swiftly and effectively.
- 3rd-party API integration experience.
- Experience deploying and managing resources in a production setting in AWS, Azure, or Google Cloud platforms.
- Experience with distributed storage technologies such as NFS S3, as well as dynamic resource management frameworks (Apache, Kubernetes, Yarn)
- Expert knowledge of Virtualization Technologies and associated services (specifically VMware vCenter, vRealize Automation, and vROPS)
- Experience with orchestration tools that support the automated delivery of enterprise platforms
- Experience with configuration management frameworks/tools (such as Ansible, Chef, Puppet, etc.)
- Knowledge of the ITIL process