Site Reliability Engineer (SRE) / System Engineer
Synechron · Concord, CA
قدّم وتابع مع أبلاي إيدجWe areAt Synechron, we believe in the power of digital to transform businesses for the better. Our global consulting firm combines creativity and innovative technology to deliver industry-leading digital solutions. Synechron’s progressive technologies and optimization strategies span end-to-end Artificial Intelligence, Consulting, Digital, Cloud & DevOps, Data, and Software Engineering, servicing an array of noteworthy financial services and technology firms. Through research and development initiatives in our FinLabs we develop solutions for modernization, from Artificial Intelligence and Blockchain to Data Science models, Digital Underwriting, mobile-first applications and more. Over the last 20+ years, our company has been honored with multiple employer awards, recognizing our commitment to our talented teams. With top clients to boast about, Synechron has a global workforce of 16,850+, and has 60 offices in 20 countries within key global markets.Our challengeWe are seeking a highly skilled Site Reliability Engineer (SRE) / System Engineer with hands-on experience in Skan.ai or process intelligence platforms. The ideal candidate will be responsible for ensuring system reliability, performance, observability, and automation while supporting AI-driven process analytics environments.Additional Information*The base salary for this position will vary based on geography and other factors. In accordance with law, the base salary for this role if filled within Concord, CA is $105k - $115k/year & benefits (see below).The RoleResponsibilities:Design, build, and maintain highly available and scalable infrastructure supporting Skan.ai deployments.Monitor system performance, availability, and reliability using observability tools (Prometheus, Grafana, ELK, etc.).Manage and optimize cloud environments (AWS/Azure/GCP) for Skan.ai workloads.Implement and maintain CI/CD pipelines to support continuous deployment and system updates.Automate operational tasks, deployments, and system provisioning using Infrastructure as Code (Terraform, Ansible, etc.).Ensure high availability, disaster recovery, and fault tolerance of applications and infrastructure.Troubleshoot complex production issues and perform root cause analysis (RCA).Work closely with AI/ML and Data Engineering teams to support Skan.ai integrations and workloads.Manage system security, access controls, and compliance standards.Improve system reliability through capacity planning, load testing, and performance tuning.Requirements:Strong experience in Site Reliability Engineering / System Engineering / DevOps roles.Hands-on experience with Skan.ai or similar process intelligence / AI platforms.Proficiency in cloud platforms: AWS, Azure, or GCP.Strong scripting/programming skills: Python, Bash, or Shell scripting.Experience with containerization & orchestration: Docker, Kubernetes.Familiarity with monitoring & logging tools: Prometheus, Grafana, ELK Stack, Datadog.Experience with CI/CD tools: Jenkins, GitHub Actions, GitLab CI.Knowledge of Linux/Unix systems administration.Understanding of networking concepts, security practices, and system architecture.Preferred, but not required:Experience working with AI/ML platforms or data-driven applications.Exposure to process mining and process intelligence tools.Knowledge of SRE best practices (SLIs, SLOs, SLAs).Experience in automation frameworks and incident management tools.Certification in cloud technologies (AWS/Azure/GCP) is a plus.We offer:A highly competitive compensation and benefits package.A multinational organization with 60 offices in 20 countries and the possibility to work abroad.10 days of paid annual leave (plus sick leave and national holidays).Maternity & paternity leave plans.A comprehensive insurance plan including medical, dental, vision, life insurance, and long-/short-term disability (plans vary by region).Retirement savings plans.A higher education certification policy.Commuter benefits (varies by region).Extensive training opportunities, focused on skills, substantive knowledge, and personal development.On-demand Udemy for Business for all Synechron employees with free access to more than 5000 curated courses.Coaching opportunities with experienced colleagues from our Financial Innovation Labs (FinLabs) and Center of Excellences (CoE) groups.Cutting edge projects at the world’s leading tier-one banks, financial institutions and insurance firms.A flat and approachable organization.A truly diverse, fun-loving, and global work culture.