SPD Technology is a place where everyone knows how to develop awesome software, does that great, and wants to do that better. We write more than code, we create solutions with business needs in mind. We want to be a part of innovations. To make that, we’re ready to learn and gain new expertise.
23 грудня 2022

Site Reliability Engineer (вакансія неактивна)

Київ, Львів, Черкаси, віддалено

SPD-Ukraine is looking for a Site Reliability Engineer to join our team.

As a Site Reliability Engineer in PitchBook’s engineering team, you will be creating and evolving systems to automatically run our suite of products and services reliably and consistently. As part of a team of site reliability engineers, you will help define service level objectives (SLOs) that determine success and build systems to achieve those objectives.

You will utilize your strong background in deploying, managing, and maintaining production systems, working with developers to operate and monitor large-scale services with complex distributed systems and data integrations. You will incorporate observability tools (monitoring, tracing, alerting), perform incident management, conduct root cause analyses, eliminate single points of failure, build reliability and redundancy into our infrastructure, establish and test our recoverability, mitigate failures, and do all of these things through automation and tools.

As an SRE, you will take independent responsibility for building and managing large subsets of our systems. You will help build our best practices for infrastructure-as-code and your code will exemplify our quality controls. You will mentor and train other SREs, platform engineers, and software engineers in reliability topics.

Your ability to collaborate with colleagues, exhibit poise and adaptability in stressful situations, communicate effectively, and build resilient systems that can be consistently relied upon will be critical to your success. You will solicit feedback, learn constantly, engage others with empathy, and help create a culture of belonging, teamwork, and purpose.

Few words about SPD-Ukraine

At SPD-Ukraine, great people create great software. Our story started in 2006. Now the team of 500+ cool specialists continues to write our story. Located in Ukraine (Kyiv, Cherkasy), we create products used by people all over the world.

Team: Team Leader and 5 DevOps.

Stack: GCP, Kubernetes, Jenkins, Java, Prometheus, ELK, Helm, PostgreSQL, RabbitMQ, Redis, MSSQL, Linux, Puppet, Terraform.

Scope of your responsibilities:

  • Establishes service level objectives (SLOs), and service level indicators (SLIs) as success criteria that our systems and processes consistently meet or exceed these targets;
  • Builds recoverability into our services and systems, including disaster recovery (DR), backups/recovery, and incorporation of multi-AZ, multi-regionality into cloud constructs;
  • Manages connectivity (CIDRs, VPCs, Subnets), latency, and availability across distributed systems;
  • Establishes clustering and load balancing techniques for high availability and scalability in containerized cloud-native environments;
  • Builds observability systems and services (monitoring, tracing) for reuse in our platform architecture, creating alerting for fault identification and building dashboards for metrics;
  • Operates and continuously improves our services’ reliability, scalability, performance, security, and uptime;
  • Learns constantly, including available cloud-managed services (PaaS/SaaS/IaaS), libraries, frameworks, and platforms (commercial and open-source).

Required qualifications:

  • 5+ years of experience building and maintaining Windows/Linux/UNIX-based systems, primarily in cloud environments (preferably GCP);
  • 3+ years of experience coding in an object-oriented language, such as Java, Python, Go, or Kotlin;
  • 1+ years of experience with containers and orchestration platforms, including Kubernetes and Docker;
  • Deep knowledge of infrastructure systems, networking, and security, including in a cloud environment;
  • Experience owning operational reliability, scalability, recoverability (backups, disaster recovery, failover), and capacity planning;
  • Experience performing operational activities including batch processing, system backups, maintenance, monitoring, and providing first-tier on-call support and being part of a 24×7 response team;
  • Experience with distributed, scalable microservices and event-driven architectures;
  • Experience with data storage, replication, caching, and search technologies, such as PostgreSQL, MS SQL Server, GCP CloudSQL, Redis, Elasticsearch, and Lucene/Solr.

Would be a plus

  • Holds at least one professional certification in GCP (DevOps or SysOps Engineer preferred).

We offer
— Financial support for your and your family’s relocation.
— Legal support of tax residence for team members outside Ukraine.
— Financial reimbursement of expenses on medical services outside Ukraine.
— Military leave status with 50% of your monthly reward saved.
— 20 working days of annual paid vacation and sick leaves.
— Educational support and financial reimbursement of language classes (Ukrainian, English, etc.)