This is a remote position.
Embrace Legal Group is a trusted legal technology solutions provider within the Embrace portfolio of companies. JST is a pioneering legal technology solutions provider within the Embrace portfolio of companies, specializing in debt collection and legal case management software. For over 30 years, JST has helped collection law firms, in-house collection departments, debt buyers, and collection agencies overcome inefficiencies in managing their accounts and compliance obligations, enabling them to reduce costs, improve recovery rates, and scale operations with confidence.
Our enterprise-grade, multi-tenant SaaS platform serves hundreds of organizations across the legal collections landscape, where uptime, data security, regulatory compliance, and operational reliability are non-negotiable.
The Opportunity
We are seeking a highly capable Platform Engineer to join our Platform Engineering team and help build, operate, and scale the infrastructure that powers JST’s legal technology platform.
In this role, you will own critical areas of our cloud infrastructure, container orchestration platform, CI/CD systems, observability stack, and operational reliability practices. You will work closely with product engineering, security, and data teams to create resilient, secure, observable, and cost-efficient systems that enable rapid software delivery while maintaining the reliability and compliance standards expected by enterprise clients.
This is a high-impact opportunity for an engineer who thrives at the intersection of cloud infrastructure, automation, developer enablement, and operational excellence in a regulated, multi-tenant SaaS environment.
Key Responsibilities
Cloud Infrastructure & Platform Architecture
Own and evolve the AWS infrastructure that underpins our multi-tenant SaaS platform.
Design, provision, and manage production-grade AWS services including EC2, S3, RDS, ECR, VPC, IAM, CloudFront, Route 53, and EKS/ECS clusters.
Implement and maintain Infrastructure as Code (IaC) using Terraform or CloudFormation to ensure repeatable, version-controlled, and auditable environments across development, staging, and production.
Architect and optimize PostgreSQL infrastructure including automated backups, replication, failover strategies, and performance tuning for high-throughput transactional workloads.
Drive high availability, disaster recovery planning, scalability, and cloud cost optimization initiatives across the platform.
Contribute to infrastructure standards, platform governance, and operational best practices.
CI/CD & Release Engineering
Build and maintain delivery pipelines that enable rapid, safe, and reliable deployments.
Design and operate CI/CD workflows for Python (Django/Flask/FastAPI) and React applications across multiple services.
Automate build, test, deployment, and rollback workflows using GitHub Actions, GitLab CI, Jenkins, or equivalent tooling.
Implement deployment strategies including blue-green, canary, and rolling deployments to reduce production risk.
Manage artifact repositories, container registries (ECR), and deployment manifests with full traceability and rollback support.
Improve developer workflows and deployment automation to increase engineering velocity and platform reliability.
Container Platform & Orchestration
Design, operate, and optimize our container orchestration platform for scalability, reliability, and tenant isolation.
Manage Docker-based development and production environments, including image hardening and registry governance.
Implement and maintain Kubernetes (EKS) or ECS infrastructure for scalable application deployments.
Define and maintain Helm charts, Kubernetes manifests, and environment-specific deployment configurations.
Enforce networking policies, namespace isolation, resource quotas, and workload security standards.
Support platform scalability, cluster health, autoscaling, and operational resilience.
Observability, Reliability & Security
Build and maintain monitoring, alerting, and observability systems using CloudWatch, Datadog, Prometheus, Grafana, or similar tooling.
Implement centralized logging and audit trail solutions across application and infrastructure layers.
Define operational standards for incident response, alerting, reliability, and system health monitoring.
Enforce infrastructure security best practices including secrets management, IAM least-privilege access, network segmentation, and certificate management.
Support compliance initiatives including SOC 2 and HIPAA through infrastructure controls, audit readiness, and vulnerability management.
Lead incident response, root cause analysis, and blameless postmortem reviews.
Cross-Functional Collaboration & Platform Enablement
Partner with engineering teams to improve deployment reliability, operational efficiency, and developer experience.
Troubleshoot infrastructure, deployment, networking, and performance issues across environments.
Author and maintain infrastructure documentation, architecture diagrams, operational runbooks, and deployment playbooks.
Mentor team members on platform engineering, infrastructure-as-code practices, operational excellence, and cloud-native tooling.
Contribute to long-term platform scalability, automation, and engineering enablement initiatives.