Senior Site Reliability Engineer - Cloud (VP) (Hybrid)
Jersey City, NJ 
Share
Posted 15 days ago
Job Description

About the Team: ICG Production Management:

Our Production Management team provides critical business and technical support to the Institutional Client Group (ICG); working collaboratively to ensure that our platforms and services operate for our clients, whenever they need them. We act as highly skilled and valued partners to our businesses.

Working as part of our team, you will help deliver a world class client service and experience, by applying engineering, innovation, learning and risk management across our systems and user environments. We interact with a diverse range of people each day, collaborating to solve problems as well as to anticipate and remove them before they occur. At Citi we look to raise the bar of operational excellence by using Site Reliability Engineering (SRE) principles to implement continuous improvement across key areas like latency, availability, performance, and capacity. We welcome candidates with SRE mindsets and experience who are keen to promote the adoption of SRE culture at Citi.

The Role:

As a Site Reliability Engineer, you will be critical in ensuring our software products' reliability, scalability, and performance. You will be responsible for designing and implementing highly available and fault-tolerant systems while working closely with the development team to deliver high-quality products. In addition, you will work on complex and challenging problems, develop innovative solutions, and contribute to a dynamic and collaborative team environment. If you have a passion for solving complex technical issues and ensuring the highest levels of system performance, we want to hear from you.

Functional Key Responsibilities:

  • Collaborate with development and product teams to ensure that applications and systems are designed and implemented with reliability, scalability, and performance in mind.
  • Automate and streamline operational processes, from deployment to monitoring and alerting, to improve efficiency and reduce manual error.
  • Design, implement, and maintain complex infrastructure systems for high-availability production environments using Terraform and Cloud Formation tools.
  • Monitor systems and applications for performance, availability, and security, and respond to issues quickly and efficiently.
  • Continuously improve systems and applications' reliability, scalability, and performance through root cause analysis, code and architecture review, and proactive monitoring.
  • Participate and respond to critical incidents promptly and efficiently, performing troubleshooting and incident management as needed.
  • Develop and maintain disaster recovery and business continuity plans to ensure business continuity in case of service outages or disasters.
  • Provide technical guidance and mentorship to other engineers on reliability and scalability best practices, tools, and methodologies.
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.

Technical Skills/Qualifications:

  • Extensive experience within the IT Industry with proven experience as a Site Reliability Engineer or AWS Cloud Specialist.
  • Working knowledge of cloud computing services, with experience in Amazon Web Services (AWS).
  • Proficiency in infrastructure tooling including Terraform.
  • An understanding of Data Warehousing tools including Snowflake/Databricks/Redshift.
  • Hands-on experience on ETL Tools including Spark, DBT, Matilion, Apache Nifi.
  • Excellent working knowledge on Dockers/Kubernetes.
  • Working knowledge on various AWS deployment strategies [ECS/EKS] and services including S3, SNS, Lambda, VPC, Route53, IAM.
  • Experience with continuous integration tools including Jenkins, Artifactory, Tekton (or) Teamcity.
  • Strong analytical and problem-solving skills.
  • Consistently demonstrates clear and concise written and verbal communication skills.

Nice to Have:

  • Basic hands-on experience with various software languages including Python, Ruby, Go, C++, .NET, and BASH.
  • Experience in implementing security and compliance policies in a production environment.
  • Data Engineering and Data Analytic skills will be considered highly advantageous
  • Ability to write automation scripts in languages including Python, Ruby, or Go.
  • Demonstratable strong leadership and program management skills.

Education:

  • Bachelor's/University Degree in Computer Science, Computer Engineering or equivalent experience will be considered.

Exceptional candidates who do not meet these criteria may be considered for the role provided they have the necessary skills and experience as outlined above .

#SREICGPM

#ICGPMNAM

-------------------------------------------------

Job Family Group:

Technology

-------------------------------------------------

Job Family:

Applications Support

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Primary Location:

Jersey City New Jersey United States

------------------------------------------------------

Primary Location Salary Range:

$137,610.00 - $206,420.00

------------------------------------------------------

Citi is an equal opportunity and affirmative action employer.

Qualified applicants will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Citigroup Inc. and its subsidiaries ("Citi") invite all qualified interested applicants to apply for career opportunities. If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review .

View the "EEO is the Law" poster. View the EEO is the Law Supplement.

View the .

View the Pay Transparency Posting

 

Job Summary
Start Date
As soon as possible
Employment Term and Type
Regular, Full Time
Required Education
Bachelor's Degree
Required Experience
Open
Email this Job to Yourself or a Friend
Indicates required fields