Site Reliability Engineer

Remote, Costa Rica
Full Time
Mid Level
We are looking for a Site Reliability Engineer to maintain the WordFly application and ensure the best product version is presented to clients 24/7/365 days a year, anywhere in the world.    

WordFly is an innovative digital marketing tool for arts, culture & entertainment venues. This B2B email marketing SaaS product helps venues connect to their audience, sell tickets, engage customers, generate development funds, and more. It is used by some of the world’s greatest museums, sports & music venues, zoos and aquariums!

The Site Reliability Engineer will maintain a customer-facing application, APIs, and other services required to deliver infrastructure resources and manage tools supporting the WordFly product development team. They will lead the current implementation of a new SLA for the WordFly application of "4 nines," or 99.99% availability each year. This would require that our total downtime for the year be less than 52 minutes and 36 seconds each year. This position is part of our WordFly team and reports to the DevOps Manager.    

What You’ll Do    
  • Support and maintain a dynamic customer-facing application managing an optimal blend of code development, infrastructure engineering, and application security to ensure 24/7/365  global availability with an SLA of 99.99% per year
  • Build and configure templates for monitoring live applications, including database systems with tools with the Elastic stack, RedGate SQL Monitor, and PRTG
  • Build, configure, and test new infrastructure services using various technologies: Ansible, Python, Bash, PowerShell, C#, and T-SQL 
  • Deploy and manage continuous integration & deployment tools
  • Guide our development teams to keep new and existing features fast and stable
  • Work across development and operational workflows to resolve technical blockers with clear and transparent solutions
  • Maintain a repository of code created with clear technical documentation and ensure efficient communication across the team
  • Live and breathe the DevOps culture at POP

What We’re Looking For   
  • 5+ years of experience in supporting production environments with Internet-facing SaaS applications and leading efforts involving these technologies and disciplines: RESTful APIs, JSON, XML, OAuth2, HTTP, SSL/TLS, Webhooks, RabbitMQ and API authentication 
  • 5+ years deploying and supporting Windows and Linux servers, SaaS web applications, Mail servers, mobile apps
  • 5+ years of experience with one or more general-purpose programming languages including but not limited to C/C++, C#, Node.js, Python, JavaScript, PowerShell
  • Expert level of knowledge and experience in developing and managing API documentation and versioning systems
  • Strong debugging, troubleshooting, and problem-solving skills
  • Expert level of ability to collaborate with product owners, developers, engineers, support staff, and other global partners to create robust and reliable APIs
  • 5+ years of Hands-on experience with systems network troubleshooting with expertise in troubleshooting traffic issues across all layers of the OSI model and performing packet captures and performing the analysis (TCP/IP, VPN, DNS, SMTP, HTTP, Firewall rules, log reviews)
  • 5+ years of Hands-on experience with relational databases Microsoft SQL Server, Amazon RDS, and NoSQL databases
  • Experience with logging & monitoring tools: Elastic stack (Kibana, Logstash, Elasticsearch), NewRelic, PRTG Network Monitor, Graphite, Postman 
  • Hands-on experience with continuous integration and deployment automation tools (Octopus, Jenkins, Puppet, Docker, Ansible, and Chef) and source control revision tools (GIT, SVN, RCS, CVS). Specific experience with Jenkins pipelines is preferred
  • Experience working with Message Transfer Agent (MTA) apps, i.e., Postfix, Sendmail, high volume SMTP servers, etc., or as an exchange admin is desired
  • An excellent communicator in English. Both verbal communication (standups, sprint planning, incident investigation calls) and written communication (emails, slack, technical documentation)

What’s In It for You   
POP offers competitive compensation and full benefits. We pay 100% of the healthcare premiums and life insurance premiums for all our employees. You have the ability to work from home and additionally, each year, we offer generous paid time off and recognize individuals for 10 years of employment with a paid sabbatical - we believe in the importance of work/life balance!

POP is an Equal Opportunity Employer that is committed to an inclusive and diverse workplace. All qualified candidates will receive consideration for employment. POP will not discriminate on the basis of race, national origin, gender, gender identity or expression, sexual orientation, protected veteran status, physical or mental disability, age, or other legally protected status. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to [email protected]

Back to current openings


Apply for this position

We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*