— Experience in advanced support of web/internet applications as NOC/SOC/SRE engineer;
— Understanding of Business KPIs;
— Scripting experience (Python/Bash/Other);
— Comfortable briefing and reporting to senior executives and clients;
— Familiar with Cloud (i.e. AWS, GCP) network architectures and the world of the Internet;
— Previous experience with Logging, Monitoring, and Management systems (e.g. AWS CloudWatch, Google StackDriver, SignalFx, DataDog, ELK,New Relic);
— Experience with SQL (preferably Google BigQuery);
— Good analytical and technical troubleshooting skills;
— High English proficiency (verbal and written);
— Previous managerial experience.
— Generous compensation with regular performance reviews;
— 20 days of paid vacation and unlimited sick leaves;
— Comprehensive medical insurance for you and your family member free of charge;
— Sports expenses reimbursement;
— Comfortable office in BC Gulliver;
— Free daily lunches in the office and fully stocked kitchen with the greatest coffee;
— Newest technical equipment (macOS);
— Training & Development / Tuition reimbursement; online courses of your choice;
— Parental leave;
— Employee Referral Program with great bonuses;
— Regular team buildings and Company Happy Hours;
— Relocation bonus for nonlocal candidates;
— Car parking paid by the Company.
— A global data-driven company, with a unique product and strong R&D center;
— Exceptional innovative and dynamic work environment;
— Promote transparency & open employee communication;
— Tremendous growth & career advancement opportunities;
— Encourage, support, and empower learning exploration and career development opportunities;
— Directly impact and build personalized product experiences for our players.
And, of course, we like having fun! We celebrate our significant days and never forget about gifts! Holidays, parties, Happy hours, and all kinds of entertaining events — brought to us by our amazing Employee Experience Expert :) Join us on the Moon!
— Establish and maintain monitoring for new product;
— Handle follow-ups and retrospective for production related incidents and tasks;
— Recruit, mentor and train SREs;
— Regular pro-active review and tuning of monitoring systems based on business needs and production incidents;
— Regular review and updating of operational delivery processes for the SRE Team;
— Act as 1st and 2nd tier infrastructure and application support;
— Gathering information from different sources and then cross-referencing it in order to attain a resolution to production incidents;
— Build cooperation and workflow with different teams such as: Product, R&D, DevOps and Support teams to escalate, troubleshoot and resolve complex issues;
— Ensure proper documentation is provided for all supported SRE activities and standards;
— Conduct Proof Of Concepts for new SRE tools and technologies.
As a Lead, you will support production activities to sustain our platforms & tools, troubleshoot, develop, maintain and document technical solutions related to Moon Active’s production infrastructure.
This position requires hands-on technical work as well as good analytical and leadership skills. As a Lead, you will work with the team and management to produce and update proper procedures for the SRE, train SRE engineers, and act as the first level of contact for leadership escalations.