We are looking for an L2 Engineer to join our team of professionals. Our candidate is motivated and interested in product development and modern technology. We adhere to the processes focusing on scaling and development keeping high product quality control.
Our project is a multi component and multi role health care system which processes data of patient state, predicts health risk based on AI model and is being broker between patients and care coordinators. It was developed over eight clinical trials at the UCLA hospital system, culminating in seven patents.
The system has multiple integrations, uses multiple modern tools and technologies and imply various use cases and scenarios due to extended configurations.
Requirements:
- Work schedule in EST time zone
- 4+ years experience of Software Production Support, troubleshooting, logs analysis (Sentry, AWS CloudWatch, Pingdom), security monitoring
- Continuous Integration processes understanding, CI/CD tools usage
- Fluent English and good communication skills
- Strong analytical and problem solving skills
- High motivation and ability to work in distributed team
Technologies Used:
- Relational DB(MySQL)/ No SQL DB (MongoDB)
- Docker containers
- AWS services (EC2, ECS, Aurora RDS, S3, ClaudWatch, ect)
- Sentry
- Pingdom
- Rest API
- Git
- Bash
Will be a plus:
- Knowledge of Python/Django framework
- Experience in performance testing
- DevOps skills/ Terraform
Responsibilities:
- Troubleshoot the system for quick problem resolving (logs analysis, restarting services, problem localization, searching for workarounds, preparing DB scripts for data fix)
- Collaborate with our Level 1 support person(s)
- Continuously study the system for understanding flow for all the users
- Deploy application on testing environments
- Find and trigger all the failure modes in the system environment
- Find and trigger all the failure modes in user flow scenarios
- Investigate potential scenarios of data breach and recovery
- Restore database backups
- Build a new environment and reroute existing urls to it
- Continuously monitor security issues in the AWS environment
- Assist with UAT and documentation of new enhancements and bug fixes.
- Contribute content to the troubleshooting guide
- Provide monthly reports.