Wirex is a British-based FinTech company with R&D in Kyiv, Ukraine. Leading a global market of hybrid personal funds, Wirex can currently boast having engaged over 2 million users and performed over 2 billion transactions.
29 июля 2020

Lead System Reliability Engineer

Киев

Are you a professional Lead System Reliability Engineer who’s looking for new challenges? Then join our development team in Kyiv to build innovative fintech products. Here you will be the part of a cutting edge company that truly recognizes and values the contribution to product development of each teammate.

Who we are:

We’re a FinTech company based in the UK, with an extensive R&D center in Kyiv and offices around the world (Atlanta, Toronto, Tokyo, and Singapore).
We are the first payment platform to seamlessly integrate digital and traditional currencies and to support multi-currency accounts, blockchain-powered cross-border transfers, and exchange services.
Our mission is to give everyone the power to use one single global platform for traditional financial and digital assets from anywhere in the world.
We have more than 3 mln users in 130 countries and we’re constantly expanding. It’s an exciting time to get on board!

Your responsibilities will be:

— Develop and manage multiple teams of Reliability Engineers. Capacity planning, scheduling.
— Lead the development of department culture, processes, procedures, technologies.
— Lead global initiatives and develop Reliability Strategy, to achieve excellence in system availability and product lifecycle management process.
— Contribute in developing of all global projects/features in product before they go live through system design consulting, capacity planning, monitoring development, and launch reviews.
— Lead in managing of lifecycle for services once they are live by measuring and monitoring availability, latency and overall system health across our cloud-hosted infrastructure.
— Take part in scaling of systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
— Design and build systems to provide real-time operational insight for development and management teams.
— Partner with development and engineering teams and leadership in it to promote best practices and provide advice on how to implement features that are instrumented and observable.
— Generate, manage, and report the application performance data captured by the monitoring tools and proactively work with DevOps teams in resolving performance issues.

TODOs:

— Team setting up and management.
— Reliability strategy planning.
— Monitoring and reaction tools: setting up, automation, assigning.
— Autoscaling.
— Logging infrastructure.
— Servers/Network security audit.
— Upgrade existing provisioning tool.
— Improve monitoring.
— Improve disaster recovery procedures.
— On-demand environments to test a single feature.
— Deploy improvements: more security, deeper integrations with CI, automation, data sharding, etc.

Requirements:

— 3+ years of IT operations/IT monitoring/DevOps.
— Experience as a Team Lead.
— Knowledge and familiarity with alerts & monitoring tools, and system management tools (likely but not limited by Grafana, Prometheus).
— Knowledge and familiarity with logs collection and analyze systems like Splunk, ELK.
— Experience in monitoring virtual and on-premises infrastructures.
— Ability to hold lots of interactions and troubleshoot highly complex error conditions.
— Desire to work in a global company with HL distributed product.
— Development of the SRE team.
— Intermediate+ English.

Nice to have:

— Knowledge and familiarity with configuration management tools including Ansible, Chef or Puppet.
— Experience in designing, monitoring, analyzing and troubleshooting distributed systems.
— Experience in monitoring complex business applications.
— Excellent communication and problem-solving skills, with the ability to drive and work in a fast-paced environment.
— Knowledge in IT process automation.
— Experience with Cloud-Based high load products.
— Experience with Azure Services (Web Apps, cloud services, VMs, Service Fabric), latest monitoring tools and services. Familiarity with setting up them and using in production.
— Experience in monitoring financial operations.

Being a member of the Wirex team means:

— Flexible office hours.
— Medical insurance and paid sick leave.
— 20 business days of paid holiday a year.
— Brand-new equipment.
— Paid state holidays and paid days on special occasions.
— Fast career development.
— PE accounting and support.
— Free parking.

LinkedIn