BS/MS degree in Computer Science or equivalent experience
Strong development skills in Java
Using Git (or any DVCS) to manage parallel development on a large codebase
Working knowledge of Apache Spark
Experience of work with distributed systems, streaming and event processing products
Linux systems setup and maintenance on hardware as well as in virtual or cloud infrastructure
Strong written and verbal communication skills
Experience in RDBM’s and NoSQL databases’ administration and data architecture, data modeling, implementation
Working knowledge of Hadoop stack: Hadoop (YARN, Hive, Impala and other would be a plus)
Bash scripting experience
• Company 21+ years in business
• Medical insurance
• Flexible schedule work
• 24 calendar days of vacation
• 5 sick leave days
• Free parking
• Friendly team 150+
• Excellent location — Podil, Verkhniy Val 4a str.
Unified Integration Platform for Big Data applications is Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop technologies to provide simple and easy-to-use APIs and a graphical UI to build, deploy, and manage complex data analytics applications in the cloud or on-premises.
It provides simple, easy-to-use abstractions to process and analyze huge datasets, without having to write and debug low-level code, leading to rapid time-to-value. It integrates the latest Big Data technologies from data integration, discovery and data science to app development, security, operations and governance.