PX is a platform for all of your Customer Acquisition needs, that brings buyers and sellers together in programmatic marketplaces. The platform optimizes performance, generates deep insights, and gives you greater control over your marketing programs through effective lead, call and appointment campaigns. Our platform offers products for both marketers and publishers, and we span tens of verticals across the Home Services, Financial Services, and Insurance industries.
We are a team of unique, quirky, hard-working, and innovative people working across four international locations, with lofty goals and down to earth personalities.
Who you are:
We are looking for a self-driven Site Reliability engineer who will join a fast-growing international company and be responsible for implementing best practices on environment infrastructure reliability.
- Support & develop products practices for high availability, scalability, reliability and performance
- Manage monitoring tools, alerts and dashboards to provide visibility into system health and performance
- Proactively identify and resolve performance bottlenecks or availability issues
- Lead the post-incident analyses to identify root causes and implement preventive measures to avoid future incidents
- Automate repetitive tasks and processes to improve efficiency and reduce manual intervention
- Mitigate broken systems and prevent them from causing future disruptions
- Create and maintain documentation for platform configuration, and troubleshooting procedures
- Help to perform capacity planning and resource allocation to ensure optimal system performance and scalability
- Collaborate with development teams to implement and deploy new features and enhancements, ensuring they meet reliability and performance standards
- Manage staging environments
What you have:
- Knowledge of Azure Cloud & Azure DevOps (or other cloud)
- Proficiency in scripting languages such as PowerShell/TypeScript/Bash
- Expertise in monitoring and logging tools such as Azure AppInsights/Grafana/Splunk.
- Knowledge of C++/C# is highly preferred.
- Experience with configuration management tools
- Understanding of networking principles and protocols (TCP/IP, HTTP, DNS, etc.).
- Knowledge of containerization technologies (Docker, Kubernetes) and orchestration tools is preferred.
Who you are:
- Bachelor’s degree in computer science, engineering, or a related field.
- Proven experience as a Site Reliability Engineer or a similar role.
- Solid understanding of software development methodologies and DevOps principles.
- Experience with agile and iterative development processes.
- Certification in relevant technologies or frameworks is a plus
- Familiarity with continuous integration/continuous deployment (CI/CD) pipelines.
- Experience with source control systems such as Git.
- Knowledge of security best practices and experience implementing security measures in a production environment.
- Strong analytical and problem-solving skills, with a focus on continuous improvement and automation.
Perks:
- Dedicated, motivated, high-energy team with a will to win in their industry
- Cross-functional teams in New York, The Netherlands, Panama, and Ukraine
- Career Development, Learning, and Training Opportunities