• Experience with Python 3.x;
• Basic knowledge of tensorflow/pytorch, gensim, scikit-learn, keras, spacy, numpy, and similar ML libraries;
• Basic understanding of how message brokers, CI pipelines, shell scripting work;
• Basic understanding of statistics and probability theory;
• Basic understanding and experience of implementation of basic NLP approaches;
• Strong understanding of business requirements;
• Desire to participate in end-to-end delivery cycle, which includes both high-level and low-level tasks (e.g. regexes development and data labeling);
• Experience with deep learning.
• Basic knowledge of Java;
• Experience with Docker containers and services;
• Advanced English knowledge.
• Competitive compensation depending on experience and skills;
• Opportunities for self-realization, professional and career growth;
• Office near the Teatral’na metro station (Bohdana Khmelnytskogo Str.);
• Compensation package (paid vacation, sick leaves), flexible working hours;
• Study and practice of English: courses and communication with colleagues and clients from different countries;
• Yoga classes and football.
• Research of latest advances in the sphere of NLP to continuously deliver new approaches and support existing pipelines;
• Integration testing, unit testing and benchmarking the code;
• Assistance with the all of the steps of ML cycle: feature engineering, model development, model training, model evaluation and prediction, pipeline integration and deployment;
• Performing of all of the data preprocessing steps: data fetching, data preprocessing, data labeling and cleaning;
• Assistance with creation of Docker containers, installation scripts, Gitlab CI, etc.;
• Development and support of rule-based structured data processing system;
• Creation of basic documentation for the repo.
We are a company, which is working on the privacy control and management solution, which lets enterprises ensure GDPR compliance. It delivers efficient up-to-date business intelligence for data protection in enterprises. 1touch.io’s purpose is to build the data data privacy technology, which provides continuous visibility into your organization’s personally identifiable information (PII) usage whether it is known or unknown, structured or unstructured, in motion or at rest.
We are looking for an ambitious highly motivated junior NLP scientist, who’s willing to participate in the development of an AI module, which is responsible for end-to-end processing of any kind of text data, which comes to the application.