Embrace Software Inc

Platform Engineer (AWS)

Embrace Software Inc  •  Remote  •  2 days ago
Apply
AI can make mistakes so check important info. Chat history is never stored.

Job Description


This is a remote position.


Embrace Legal Group is a trusted legal technology solutions provider within the Embrace portfolio of companies. JST is a pioneering legal technology solutions provider within the Embrace portfolio of companies, specializing in debt collection and legal case management software. For over 30 years, JST has helped collection law firms, in-house collection departments, debt buyers, and collection agencies overcome inefficiencies in managing their accounts and compliance obligations, enabling them to reduce costs, improve recovery rates, and scale operations with confidence.


Our enterprise-grade, multi-tenant SaaS platform serves hundreds of organizations across the legal collections landscape, where uptime, data security, regulatory compliance, and operational reliability are non-negotiable.


The Opportunity


We are seeking a highly capable Platform Engineer to join our Platform Engineering team and help build, operate, and scale the infrastructure that powers JST’s legal technology platform.


In this role, you will own critical areas of our cloud infrastructure, container orchestration platform, CI/CD systems, observability stack, and operational reliability practices. You will work closely with product engineering, security, and data teams to create resilient, secure, observable, and cost-efficient systems that enable rapid software delivery while maintaining the reliability and compliance standards expected by enterprise clients.


This is a high-impact opportunity for an engineer who thrives at the intersection of cloud infrastructure, automation, developer enablement, and operational excellence in a regulated, multi-tenant SaaS environment.


Key Responsibilities


Cloud Infrastructure & Platform Architecture


  • Own and evolve the AWS infrastructure that underpins our multi-tenant SaaS platform.

  • Design, provision, and manage production-grade AWS services including EC2, S3, RDS, ECR, VPC, IAM, CloudFront, Route 53, and EKS/ECS clusters.

  • Implement and maintain Infrastructure as Code (IaC) using Terraform or CloudFormation to ensure repeatable, version-controlled, and auditable environments across development, staging, and production.

  • Architect and optimize PostgreSQL infrastructure including automated backups, replication, failover strategies, and performance tuning for high-throughput transactional workloads.

  • Drive high availability, disaster recovery planning, scalability, and cloud cost optimization initiatives across the platform.

  • Contribute to infrastructure standards, platform governance, and operational best practices.


CI/CD & Release Engineering


  • Build and maintain delivery pipelines that enable rapid, safe, and reliable deployments.

  • Design and operate CI/CD workflows for Python (Django/Flask/FastAPI) and React applications across multiple services.

  • Automate build, test, deployment, and rollback workflows using GitHub Actions, GitLab CI, Jenkins, or equivalent tooling.

  • Implement deployment strategies including blue-green, canary, and rolling deployments to reduce production risk.

  • Manage artifact repositories, container registries (ECR), and deployment manifests with full traceability and rollback support.

  • Improve developer workflows and deployment automation to increase engineering velocity and platform reliability.


Container Platform & Orchestration


  • Design, operate, and optimize our container orchestration platform for scalability, reliability, and tenant isolation.

  • Manage Docker-based development and production environments, including image hardening and registry governance.

  • Implement and maintain Kubernetes (EKS) or ECS infrastructure for scalable application deployments.

  • Define and maintain Helm charts, Kubernetes manifests, and environment-specific deployment configurations.

  • Enforce networking policies, namespace isolation, resource quotas, and workload security standards.

  • Support platform scalability, cluster health, autoscaling, and operational resilience.


Observability, Reliability & Security


  • Build and maintain monitoring, alerting, and observability systems using CloudWatch, Datadog, Prometheus, Grafana, or similar tooling.

  • Implement centralized logging and audit trail solutions across application and infrastructure layers.

  • Define operational standards for incident response, alerting, reliability, and system health monitoring.

  • Enforce infrastructure security best practices including secrets management, IAM least-privilege access, network segmentation, and certificate management.

  • Support compliance initiatives including SOC 2 and HIPAA through infrastructure controls, audit readiness, and vulnerability management.

  • Lead incident response, root cause analysis, and blameless postmortem reviews.


Cross-Functional Collaboration & Platform Enablement


  • Partner with engineering teams to improve deployment reliability, operational efficiency, and developer experience.

  • Troubleshoot infrastructure, deployment, networking, and performance issues across environments.

  • Author and maintain infrastructure documentation, architecture diagrams, operational runbooks, and deployment playbooks.

  • Mentor team members on platform engineering, infrastructure-as-code practices, operational excellence, and cloud-native tooling.

  • Contribute to long-term platform scalability, automation, and engineering enablement initiatives.


Requirements


Must-Have Skills


  • 5+ years of progressive DevOps/SRE experience in SaaS or enterprise environments.

  • Infrastructure as Code using Terraform (AWS provider, modules, multi-environment state management).

  • AWS core services: EKS, ECR, RDS, VPC, IAM, CloudWatch, ALB, EFS, S3, CloudFront, Route 53.

  • Kubernetes administration: Helm charts, pods, deployments, services, kubectl, autoscaling.

  • Docker containerization including multi-stage builds and registry operations.

  • CI/CD pipelines: AWS CodeBuild, GitHub Actions, GitLab CI, or Jenkins.

  • PostgreSQL production management: backup automation, replication, monitoring, performance tuning.

  • Linux systems administration (Ubuntu/Amazon Linux) and shell scripting proficiency.

  • Networking fundamentals: DNS, load balancing, TLS/SSL, firewall rules, VPN configurations.

  • Monitoring and observability: Datadog, FluentBit, CloudWatch Logs.

  • Security: AWS Secrets Manager, ACM certificates, security groups, IAM policies.

  • Application stack: Django, Celery, Redis, PostgreSQL, Nginx.

  • Git workflows, branching strategies, and pull request review processes.

  • Strong problem-solving skills with a proactive, ownership-driven approach.


Good-to-Have Skills


  • Advanced AWS services: AWS Backup, Lambda, SNS, EventBridge.

  • Advanced Kubernetes: EFS CSI driver, AWS Load Balancer Controller, Cluster Autoscaler.

  • Python scripting for infrastructure automation and operational workflows.

  • Multi-tenant SaaS architecture, tenant isolation strategies, and data partitioning.

  • Third-party service integration (SendGrid, Twilio) at the infrastructure level.

  • FinOps practices: cloud cost management, reserved/spot instance optimization.

  • Compliance frameworks (SOC 2 Type II, HIPAA) and required infrastructure controls.

  • Service mesh technologies (Istio, Linkerd) or API gateway solutions.

  • Cluster management tools like Rancher.

  • Database disaster recovery: snapshots, cloning, multi-region considerations.

  • Container security scanning and ClamAV integration.

  • Infrastructure documentation and multi-environment workflows (dev → stg → prod).

  • AWS certifications (Solutions Architect, DevOps Engineer Professional).


Benefits


  • Competitive salary commensurate with experience.

  • Opportunities for career advancement and professional development.

  • Experience collaborating with a diverse, global team within a remote work setting.
Embrace Software Inc

About Embrace Software Inc

Embrace acquires and invests in niche software that provides industry-specific solutions. We believe best practices followed by enterprise software are equally relevant for small software businesses. Access to capital and know-how enables our acquired companies to serve their customers better and grow their businesses faster.

Industry
IT & Software
Company Size
51-200 employees
Headquarters
Tampa, Florida
Year Founded
2021
Social Media