- Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Lead Site Reliability Engineer (SRE).
 
  
  
    
    
  
      Urgent! Lead Site Reliability Engineer (SRE) Position in Los Angeles - EPAM Systems
 
                        
                         At EPAM, we’re not just building software — we’re engineering excellence.
  
We’re looking for a  **Lead Site Reliability Engineer (SRE)**  with a passion for performance, precision, and proactive problem-solving to join a high-impact team supporting a leading sell-side trading environment.
  
This role is ideal for someone who thrives in fast-paced financial systems, has a passion for working with data and monitoring tools, and wants to shape the reliability and efficiency of next-generation trading platforms.
  
The Site Reliability Engineer will focus on ensuring stable connectivity to external partners within a SaaS environment.
The ideal candidate will have expertise in financial systems, especially within trading ecosystems, and the ability to proactively drive performance enhancements and improve data usage and analysis.
By identifying areas of opportunity, they will help deliver improved service and systems for end users.
  
Additionally, the candidate will help proactively identify system issues, implement changes and resolutions, and ensure the stability of business-critical applications.
They will collaborate to build actionable plans, execute strategies, and lead initiatives to enhance system reliability.
  
**Responsibilities**
  
+ Provide a strategic vision for trading portfolio performance, covering network connectivity, traffic throughput, and applications
+ Define, configure, and set up alerting and monitoring frameworks for critical applications
+ Monitor application and platform performance using APM and monitoring tools to diagnose and resolve performance issues
+ Collaborate with Azure Cloud environments and contribute to a 24x7x365 support team to diagnose and address system challenges
+ Assess environmental and incident priorities, investigate issues swiftly, and execute efficient resolutions
+ Troubleshoot mission-critical systems and implement preventative problem management solutions
+ Lead on promoting observability, scalability, and resiliency best practices across development and operations teams
+ Analyze, design, and implement solutions to meet application performance and reliability goals
+ Collaborate with cross-functional teams to ensure smooth and unified troubleshooting and resolution processes across departments
+ Craft and maintain SLA/SLO dashboards to monitor system health and performance
+ Define and maintain SLIs, SLOs, and error budgets for applications and infrastructure to drive service improvement
+ Automate operational processes to enhance service offerings and system reliability
  
**Requirements**
  
+ 5+ years of experience in site reliability engineering, production support, or related roles in fast-paced environments
+ Showcase of leadership or mentoring experience (minimum of 1 year) in guiding cross-functional teams on system reliability
+ Knowledge of monitoring and observability tools such as AppDynamics, New Relic, Prometheus, or Grafana
+ Background in Azure Cloud services, CI/CD pipelines, and container orchestration (Kubernetes or Docker)
+ Proficiency in scripting with Python, Bash, or PowerShell for automation and efficiency gains
+ Understanding of network protocols (TCP/IP, DNS, HTTP) and troubleshooting tools such as Wireshark or tcpdump
+ Capability to analyze complex system issues and performance bottlenecks using APM and log analysis
+ Familiarity with implementing SLA/SLO metrics and monitoring for production systems
+ Combined skills in high-availability systems and database performance optimization
  
**Nice to have**
  
+ Expertise in SaaS solutions and APIs with a focus on handling external trading partners
+ Knowledge of disaster recovery strategies and business continuity planning
+ Background in trading platforms or buy-side/sell-side financial environments
  
EPAM is a leading global provider of digital platform engineering and development services.
We are committed to having a positive impact on our clients, our employees, and our communities.
We embrace a dynamic and inclusive culture.
Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow.
No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.Engineer the Future with a Career at EPAM (https://www.youtube.com/embed/NU_mnNITn2o?si=IiCxyQ4sr1YJWxDG)
  
**This Remote Position Cannot be Performed in New York City.**
  
Applications will be accepted on a rolling basis.
  
In accordance with the LA County Fair Chance Ordinance, you may find a copy of the Notice containing a summary of the Ordinance’s key provisions here:  Concept FCO Posting 8 27 24 (lacounty.gov)
  
H1B visa sponsorship is not available for this position.
  
It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment.
An employer who violates this law shall be subject to criminal penalties and civil liability.
EPAM Systems, Inc.
is an equal opportunity employer.
 We recognize the value of diversity and inclusion in creating success for our customers, business partners, shareholders, employees and communities.
We are committed to recruiting, hiring, developing and promoting employees without discrimination.
As a global employer, this commitment includes complying with all laws in the countries in which we operate.
Nevertheless, we believe equal employment practices should not be limited to what the law requires.
Equal opportunity and inclusion are essential to motivate, empower and recognize the best in everyone.
At EPAM, employment actions are based on individual qualifications, without regard to race, color, religion, creed, gender, pregnancy status, sexual orientation, gender identity, gender expression, marital or familial status, national origin, ancestry, genetics, age, disability status, veteran status, citizenship status when otherwise legally able to work, or any other characteristic protected by law.
 
                      
✨ Smart • Intelligent • Private • Secure
Practice for Any Interview Q&A (AI Enabled)
Predict interview Q&A (AI Supported)
Mock interview trainer (AI Supported)
Ace behavioral interviews (AI Powered)
Record interview questions (Confidential)
Master your interviews
Track your answers (Confidential)
Schedule your applications (Confidential)
Create perfect cover letters (AI Supported)
Analyze your resume (NLP Supported)
ATS compatibility check (AI Supported)
Optimize your applications (AI Supported)
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
O*NET Supported
European Union Recommended
Institution Recommended
Institution Recommended
Researcher Recommended
IT Savvy Recommended
Trades Recommended
O*NET Supported
Artist Recommended
Researchers Recommended
Create your account
Access your account
Create your professional profile
Preview your profile
Your saved opportunities
Reviews you've given
Companies you follow
Discover employers
O*NET Supported
Common questions answered
Help for job seekers
How matching works
Customized job suggestions
Fast application process
Manage alert settings
Understanding alerts
How we match resumes
Professional branding guide
Increase your visibility
Get verified status
Learn about our AI
How ATS ranks you
AI-powered matching
Join thousands of professionals who've advanced their careers with our platform
Unlock Your Lead Site Potential: Insight & Career Growth Guide
Real-time Lead Site Jobs Trends in Los Angeles, United States (Graphical Representation)
Explore profound insights with Expertini's real-time, in-depth analysis, showcased through the graph below. This graph displays the job market trends for Lead Site in Los Angeles, United States using a bar chart to represent the number of jobs available and a trend line to illustrate the trend over time. Specifically, the graph shows 90059 jobs in United States and 905 jobs in Los Angeles. This comprehensive analysis highlights market share and opportunities for professionals in Lead Site roles. These dynamic trends provide a better understanding of the job market landscape in these regions.
Great news! EPAM Systems is currently hiring and seeking a Lead Site Reliability Engineer (SRE) to join their team. Feel free to download the job details.
Wait no longer! Are you also interested in exploring similar jobs? Search now: Lead Site Reliability Engineer (SRE) Jobs Los Angeles.
An organization's rules and standards set how people should be treated in the office and how different situations should be handled. The work culture at EPAM Systems adheres to the cultural norms as outlined by Expertini.
The fundamental ethical values are:The average salary range for a Lead Site Reliability Engineer (SRE) Jobs United States varies, but the pay scale is rated "Standard" in Los Angeles. Salary levels may vary depending on your industry, experience, and skills. It's essential to research and negotiate effectively. We advise reading the full job specification before proceeding with the application to understand the salary package.
Key qualifications for Lead Site Reliability Engineer (SRE) typically include Other General and a list of qualifications and expertise as mentioned in the job specification. Be sure to check the specific job listing for detailed requirements and qualifications.
To improve your chances of getting hired for Lead Site Reliability Engineer (SRE), consider enhancing your skills. Check your CV/Résumé Score with our free Resume Scoring Tool. We have an in-built Resume Scoring tool that gives you the matching score for each job based on your CV/Résumé once it is uploaded. This can help you align your CV/Résumé according to the job requirements and enhance your skills if needed.
            Here are some tips to help you prepare for and ace your job interview:
Before the Interview:To prepare for your Lead Site Reliability Engineer (SRE) interview at EPAM Systems, research the company, understand the job requirements, and practice common interview questions.
Highlight your leadership skills, achievements, and strategic thinking abilities. Be prepared to discuss your experience with HR, including your approach to meeting targets as a team player. Additionally, review the EPAM Systems's products or services and be prepared to discuss how you can contribute to their success.
By following these tips, you can increase your chances of making a positive impression and landing the job!
Setting up job alerts for Lead Site Reliability Engineer (SRE) is easy with United States Jobs Expertini. Simply visit our job alerts page here, enter your preferred job title and location, and choose how often you want to receive notifications. You'll get the latest job openings sent directly to your email for FREE!