— Solid skills in Linux administration
— Experience and understanding of web technologies (REST, cloud-based applications)
— Experience with Logging and Management Systems (AWS Cloudwatch, ELK)
— Understanding of monitoring principles and experience in using of application & infrastructure monitoring tools (such as NewRelic, Zabbix, Prometheus, Grafana, ELK)
— Good analytical and technical troubleshooting skills and Linux & Windows performance issues
— Experience with SQL
— Experience with cloud-based infrastructure (AWS, GCP, Azure)
— Experience with VMware or other virtualization technology
— Experience with at least one of the programming or scripting languages (preferably Python, Bash, GO)
— Hands-on network administration experience: TCP / IP stack, HTTP, DNS, Load-balancers, PKI (TLS), network security, etc.
— Windows administration experience
— Experience with Docker and Docker orchestration systems (like a “Kubernetes”, “Docker Swarm”, etc.)
— Understanding or experience with CI / CD
— Understanding principles of IaC (we use Terraform)
— Proficient understanding of Git
— High & competitive salary
— Challenging work in an international professional environment
— Opportunity to influence software development process, to be the owner of the product in your field of expertise
— Opportunity to apply SAFe methodology
— Flexible management
— Flexible / Casual Leave
— Relocation Bonus when moving from a different city / country
— Full benefits package: paid vacation and sick leave
— Continuous professional development (free internal and external professional trainings)
— Free English classes in the company office
— Free use of the services provided by Namecheap
— Quarterly teambuilding activities and company corporate events
— RDX gym membership
— Coffee, tea, fruits
— Help with automation and DevOps processes
— Analyze and improve the availability, latency, performance, and efficiency of the applications
— Own the day-to-day health, uptime, monitoring, and reliability of services and server infrastructure.
— Capacity planning and provisioning
— Consult in areas of reliability and scalability for the development of new applications.
— Work together with teams in other departments to find solutions
— Root Cause Analysis for all production outages
— Improve monitoring and alerting systems
— Hardware and software monitoring
— The position requires 24×7 support rotation with other team members — work with servers and services is required (NO customer support or interaction with end users / customers is expected)
Namecheap offers domain names at some of the best prices in the industry, along with fully featured hosting packages, secure SSL certificates, WhoisGuard privacy protection service, and more. We work hard to provide unparalleled levels of service, security, and support. We strive to offer intuitive products at the most competitive prices in the business.
Now we are looking for Site Reliability Engineer, the guy who will manage and mitigate problems on Linux and Windows servers and monitor the health condition of services. This candidate will be someone who lives and breaths production and sandbox environments and does tasks with best-known practices.