About

We love our portfolio companies.

You’ll love working for one of them.

0
Companies
0
Jobs

Senior Software Engineer

Own Company

Own Company

Software Engineering
Mexico City, Mexico
Posted on Jan 19, 2026

Description

Senior Software Engineering

DET Team

Mexico City - Hybrid

Job responsibilities

Responsible for managing the day-to-day operations, ensuring product administration, platform reliability, and overseeing incident management and resolution processes. You will collaborate closely with engineering, product, and infrastructure teams to ensure the smooth functioning of systems and platforms and provide a high level of operational support to meet business goals.

  • Own incident resolution processes for L1 and L2 operations, ensuring timely and effective troubleshooting of technical issues.
  • Define and implement procedures for handling escalations and high-priority incidents.
  • Ensure root cause analysis is conducted for major incidents, and follow up on remediation actions.
  • Develop and enforce Service Level Agreements (SLAs) and Key Performance Indicators (KPIs) for platform performance and product support operations.
  • Monitor adherence to SLAs and manage escalations to maintain customer satisfaction.
  • Oversee the platform's operational stability and performance, ensuring high availability and scalability.
  • Monitor and manage platform performance metrics, proactively addressing any potential issues.
  • Ensure comprehensive documentation of operational procedures, troubleshooting guides, and runbooks for the L1/L2 support teams.
  • Track and create detailed operational reports and dashboards for tracking system health
  • Manage SaaS platform administrations and self-hosted applications in cloud environments like AWS.

Qualifications:

  • 6+ years of experience in IT
  • Strong understanding of IT infrastructure, cloud platforms, and operational best practices.
  • Strong experience on Docker, Kubernetes & Helm along with any programming language (Java preferred) experience to support platform KLO & monitoring
  • Proven experience with incident management, service management, and driving process improvements.
  • Expertise in monitoring tools, automation frameworks, and platform performance optimization.
  • Strong understanding of networking fundamentals (TCP/IP, DNS, load balancing, etc) and authentication/authorization mechanisms (OAuth 2.0, SAML, JWT etc)

Technical Stack

  • Cloud: AWS (EC2, EKS, RDS, S3, Lambda, CloudWatch, VPC, IAM, Route53)
  • Containers: Docker, Kubernetes (EKS), Helm, containerd
  • Monitoring: Prometheus, Grafana, Datadog, CloudWatch, New Relic, ELK Stack
  • CI/CD: Jenkins, GitLab CI, GitHub Actions, ArgoCD, Flux
  • Languages: Python,Java, Bash/Shell
  • Databases: PostgreSQL, MySQL, Redis,Elastic Search
  • Tools: Git, Vault,CyberArk, AWS Secrets Manager, PagerDuty
  • IaC: Terraform, CloudFormation, Ansible, Pulumi
  • Auth: OAuth 2.0, SAML, JWT, AD, AWS IAM, RBAC