Ensure operational stability and high availability
Lead and coordinate the external operations team to meet service level agreements and operational requirements
Manage and improve CI/CD pipelines for seamless deployment
Oversee AWS cloud infrastructure including ECS instances, containerization solutions, and related cloud services
Implement and maintain monitoring systems (Prometheus, Grafana, ELK) for proactive issue identification and resolution
Manage Snowflake database operations including performance monitoring, backup strategies, and access controls
Analyze and resolve data flow issues to maintain seamless integration between applications and data integrity
Implement and maintain security measures for authentication, authorization, and data protection across the application stack
Coordinate between internal teams and external partners as the primary point of contact for operational matters
Identify infrastructure issues and work with development teams to implement timely resolutions.
Monitor resource utilization and plan capacity needs to support application growth and performance
Develop and maintain disaster recovery plans and procedures to ensure business continuity
Anticipate challenges and implement preventive measures to enhance system reliability before problems arise
Maintain comprehensive documentation of system configurations, processes, and troubleshooting procedures
Monitor cloud resource costs and provide recommendations for optimization without compromising performance
Work with development teams to identify and implement performance improvements across the application stack
What should you bring along
Proactive mindset with a hands-on attitude towards problem-solving
Bachelor's degree in Computer Science, Information Technology, or a related field
Minimum of 5-7 years professional work experience in IT operations management
Experience managing technical teams, preferably in a vendor management capacity
Proven experience in IT operations, particularly in managing Java-based backend services and modern frontend applications within AWS cloud environments
Strong understanding of CI/CD practices and tools (GitHub Actions, Jenkins, Terraform, etc.)
Knowledge of database operations, particularly with Snowflake or similar cloud data warehouses
Experience with monitoring tools (Prometheus, Grafana, ELK stack) and APM solutions
Excellent communication skills, with the ability to coordinate effectively with diverse teams and stakeholders
Familiarity with infrastructure and security requirements, acting as the first contact for external requests
Preferred Skills
AWS certification (Solutions Architect, DevOps Engineer) or equivalent cloud experience
Knowledge of scripting languages (e.g., Python, Bash) for automation tasks
Experience with Infrastructure as Code (Terraform, CloudFormation)
Familiarity with security best practices in cloud environments and OWASP guidelines
Experience with log aggregation and analysis tools
Understanding of operations in the area of financial services and the need for compliance with regulations
Knowledge of data protection and privacy requirements in financial applications
Experience in agile working methodologies and DevOps culture
Incident management and problem-solving experience in enterprise environments
Knowledge of cost optimization strategies for cloud resources