Fulcrum Digital Inc

Sr Chaos Engineer

Fulcrum Digital Inc  •  Pune, IN (Onsite)  •  5 months ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description


Who are we


Fulcrum
Digital is an agile and next-generation digital accelerating company providing
digital transformation and technology services right from ideation to
implementation. These services have applicability across a variety of
industries, including banking & financial services, insurance, retail,
higher education, food, healthcare, and manufacturing.


Key Responsibilities:

Chaos Engineering:

· Design and implement chaos engineering experiments to identify weaknesses in systems and applications.

· Develop and execute strategies to improve system resilience and reliability.

· Analyze experiment results, provide actionable insights, and drive remediation efforts.

· Collaborate with development, operations, and infrastructure teams to integrate chaos engineering practices.

Operational Acceptance:

· Develop and maintain comprehensive operational acceptance criteria for new and existing systems.

· Conduct thorough operational acceptance testing, ensuring systems meet all predefined criteria before go-live.

· Work closely with project managers, developers, and QA teams to align operational acceptance processes with project timelines and objectives.

· Document and communicate operational readiness findings, providing recommendations for improvement.

System Resilience and Reliability:

· Implement and manage strategies for continuous improvement of system resilience and reliability.

· Monitor and assess system performance, identifying potential risks and areas for enhancement.

· Lead initiatives to improve disaster recovery and business continuity plans.

· Stay updated with the latest industry trends and best practices in chaos engineering and operational acceptance.

Collaboration and Training:

· Educate and mentor team members on chaos engineering and operational acceptance methodologies.

· Foster a culture of resilience and reliability within the organization.

· Engage with external communities, attending conferences and participating in knowledge-sharing events.


Requirements


Extensive experience in chaos engineering, operational acceptance testing, and system resilience.

Strong understanding of cloud platforms (AWS, Azure, GCP) and their resilience features.

Proficiency in scripting and automation tools (Python, Bash, Terraform, etc.).

Experience with monitoring and observability tools (Prometheus, Grafana, Splunk, etc.).

Experience with Chaos Engineering Tools such as Gremlin, Chaos Monkey etc.,

Excellent analytical and problem-solving skills.

Strong communication and collaboration skills, with the ability to work effectively in cross-functional teams.

Certifications in relevant fields (e.g., AWS Certified Solutions Architect, Azure DevOps Engineer) are a plus.
Fulcrum Digital Inc

About Fulcrum Digital Inc

Founded in 1999, Fulcrum Digital is a global system integrator driving transformative results through AI-powered solutions. We implement, integrate, and grow ecosystems across industries, delivering high-quality outcomes at a fraction of the cost. Specializing in AI, Creative, Infrastructure, and e-commerce services, we empower businesses in Insurance, Financial Services, Government, Education, Manufacturing, Retail, and Logistics to thrive.

Our Core Platforms:

- FD Ryze: Our Generative AI SaaS solution that enables businesses to integrate LLMs, neuro readers, AI computing, and intelligent chatbots across various industries and functions. This integration enhances both internal workflows and customer-facing interactions by leveraging the full potential of artificial intelligence.

- FulcrumOne: A low-code platform with 130+ ready-made components, designed to simplify development and launch products 1.5x faster while cutting costs by up to 30%.

From financial services to education, consumer products to e-commerce, we work alongside you—digging deep into your challenges, helping you implement smarter strategies, and setting you up with platforms that solve today’s problems and are ready for tomorrow’s opportunities.

Let’s get started and see what we can build together.

Industry
IT & Software
Company Size
1,001-5,000 employees
Headquarters
New York, NY
Year Founded
1999
Social Media