Sphere Software is a global provider of high-quality software development, testing and consulting services. We are passionate about bringing the best commercial software to market, helping start-up businesses and Fortune 500 enterprises. Clients rely on our solutions to drive business growth and customer satisfaction.
6 лютого 2024

Site Reliability Engineer, Sr (вакансія неактивна)

віддалено

Sphere partners with Clients to transform their organizations, embed technology and processes into everything they do and enable lasting competitive advantage. We combine global expertise and local insight to help people and companies turn their ambitious goals into reality. At Sphere, we put people first and strive to be a changemaker by building a better future through innovation and technology. Sphere is helping a multinational company to innovate and bring the product to market and is looking for a Site Reliability Engineer, Sr. who will join the team on a full-time basis.

Location: Remotely
Type: Full-time
Start Date: ASAP

Responsibilities include but are not limited to:

  • Run the production environment by monitoring availability and taking a holistic view of system health;
  • Build software and systems to manage platform infrastructure and applications;
  • Improve reliability, quality, and time-to-market of our suite of software solutions;
  • Make monitoring and alerting alert on symptoms and not on outages;
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement;
  • Provide primary operational support and engineering for multiple large-scale distributed software applications;
  • Practice sustainable incident response as well as participate in blameless postmortems. Understand applicable company policies, procedures and other job-specific instructive documents and materials;
  • Conduct business and perform job duties in a manner consistent with the requirements set forth in all company policies, procedures and other directives, and in compliance with legal and regulatory requirements;
  • Complete all compliance training assigned to them to understand the key provisions of law, regulation and internal policies and procedures applicable to their job duties, as well as the impact of non-compliance on the company’s reputation and success;
  • Raise concerns about any practice(s) believed to be a violation of, or inconsistent with, company policies, procedures or other directives, or in violation of legal or regulatory requirements;
  • Monitor processes and procedures to ensure safety and compliance;

Requirements:

  • Strong experience in maintaining AWS cloud infrastructure;
  • Experience with infrastructure automation and container orchestration tools — Docker, Kubernetes, Terraform, Helm etc.;
  • Knowledge on any one of — Python, Shell, Go or Powershell;
  • Strong debugging/troubleshooting skills;
  • Deep working knowledge on Windows/Linux servers and networking;
  • Experience with monitoring/logging solutions like DataDog, ELK, SignalFx, Prometheus;
  • Monitoring and instrumentation: implement metrics in Prometheus, Grafana, log management and related system, and Slack/PagerDuty integrations;
  • Engineering practices: availability, reliability and scalability, as well as disaster recovery;
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement;
  • Familiarity with continuous integration and deployment tools like Gitlab CI/Argo workflow/Argo CD;
  • Experience with modern cloud development practices (microservices architectures, REST interfaces, etc.);
  • Always ready to learn more and adopt new cutting edge technology with the right value proposition;
  • Strong written and verbal communication skills;
  • Ability to work independently and as part of a team.

Sphere offers a competitive and rewarding salary and benefits package, as well as an intellectually and creatively stimulating work environment and flexibility.