Part-Time DevOps & Incident Response Engineer Location: Remote – LATAM preferred Seniority Level: Senior Contract Type: Independent Contractor (part-time) About RoverPass RoverPass is a leading reservation management platform for RV parks and campgrounds across the United States. Our software enables campground owners to manage bookings, process payments, and handle guest communications in one streamlined system. The platform is built primarily in Ruby on Rails , with a React frontend, and hosted on DigitalOcean . We operate a mature, stable product with a high degree of automation. While incidents are rare, having a reliable expert on call for system-level support is critical to ensuring continuity and performance. Responsibilities We’re looking for a DevOps Engineer with strong infrastructure and monitoring experience to support our team on a part-time, on-call basis . Your work will focus on infrastructure supervision, system reliability, and emergency response — not feature development. You will: Monitor infrastructure and application health (DigitalOcean Droplets, background jobs, system resources) Respond to system outages, deployment failures, and other high-priority incidents Investigate logs, alerts, and performance metrics to identify root causes Perform basic infrastructure fixes, rollbacks, or patches when needed Ensure CI/CD pipelines (GitHub Actions + Capistrano) are running correctly Manage environment variables, SSL certs, Nginx configuration, and Redis/Sidekiq queues Collaborate with the development team to resolve technical blockers related to deployments or infrastructure Identify areas for performance, security, or reliability improvement when appropriate Required Skills & Experience Proven experience deploying and supporting Ruby on Rails applications in production Strong understanding of DigitalOcean , Redis , PostgreSQL , and Linux server environments Experience configuring and managing CI/CD pipelines (GitHub Actions, CircleCI, etc.) Familiarity with reverse proxy setup , SSL certificates , and firewall configuration Experience with log analysis , system monitoring, and job queue management Availability to respond to incidents in a timely and structured manner Excellent communication skills and ability to work independently Preferred Qualifications Experience with Capistrano , Terraform , or Docker Exposure to frontend hosting (React with CDN or Nginx) Prior work supporting SaaS or platform-based applications Based in LATAM or working within U.S. Central Time #J-18808-Ljbffr