— Production experience with Kubernetes, both managed and self-managed.
— Experience implementing Continuous Integration / Continuous Delivery with ArgoCD or Jenkins.
— Experience deploying and managing observability tools such as Prometheus, Grafana and the ELK stack
— Solid AWS experience, experience with GCP is a plus.
— A desire to write tools and applications to automate work.
— Experience in engineering highly scalable and distributed systems.
— Familiarity with bare metal bootstrap, provisioning, configuration, and orchestration is a plus.
— Comfortable with Go, Python, bash scripts, etc.
— Strong written and verbal communication skills.
— Self-directed, analytical, and work well in a team environment.
— Passionate about the Restream product.
— A geeky and data-driven team that is always on the look for the most innovative technologies and processes available.
— The journey of growth — startup — with our distributed team.
— A culture of trust, transparency, and independence.
— Improve and maintain Restream infrastructure platform-as-a-service used by our software engineers.
— Maintain and improve Continuous Integration / Continuous Deployment development workflow.
— Adapt modern tools/framework to achieve proactive monitoring for both infrastructure and application levels.
— Work closely with software engineering teams to maintain four nines SLA.
— Take part in architecture discussions and knowledge sharing with software engineers.
— Maintain a pulse on emerging technologies and discover hidden opportunities in our environment.
Restream is looking for a talented Site Reliability Engineer who is passionate about continuous integration, delivery, and high availability as well as the product that we are building together. Join a fast-moving team working hard to build, improve and maintain Restream infrastructure platform and keep Restream running as we grow (and we grow fast!) while delivering new amazing products and features.