Вакансия Senior Data Engineer / Python

15 июля 2020    122

About Us:

IHS Markit is a global information company that provides expertise, data and solutions to 50,000+ customers helping them in making more informed decisions. Our research and development center in Minsk focuses on creation of three intellectual platforms of IHS Markit related to engineering and manufacturing domains. This includes scalable cognitive engines that help users – engineers, innovators and researchers – to discover and leverage knowledge locked in corporate repositories as well as in industry sources.  

The Minsk AI team is looking for new talents for the role of Senior Data Engineer critical for success of Data Science and Machine Learning/Deep Learning projects related to Natural Language Processing, Data Capturing, Content Understanding and Information Retrieval domains.  

Your Role:

You will be responsible for design and engineering aspects of innovative projects related to automatic structuring and understanding of unstructured content. Your role is needed to envision and build robust data extraction, processing and transformation pipelines, efficiently apply intelligent models, as well as to curate all questions related to data life cycle. 

Your duties will include: 

  • Owning the vision and development of data engineering aspects of the projects with Deep Learning, Natural Language Processing, Data Capturing and/or Information Retrieval  

  • Working in the team with data scientists, ML engineers and developers on building the intelligent capabilities into company products 

  • Responsibility for research in the ETL and data processing technologies, as well as tools around deployment of ML-based components to production pipeline 

  • Hunting for data to empower data-driven development: finding and scraping data sources for experimental setups, transforming, cleaning and curating the datasets 

  • Designing and implementing data processing pipelines (designing ETL system for ML/DL projects, designing online leaning loops, embedding active learning algorithms into data annotation toolset, etc.) 

  • Organizing data warehousing, storage and versioning (make sure ML experiments are repeatable and keep track from data state)  

About You:

You are a talented engineer who is addicted to complex and fuzzy challenges. Your required qualifications and experience include: 

  • Strong coding and software engineering skills 

  • Ability to make good design decisions related to data 

  • Python programming experience 4+ years  

  • Experience with textual data engineering (encoding, formats, tools)  

  • Developed skills in algorithms and data structures 

  • Experience with data processing automation, schedulers and pipeline tools (Airflow, NiFi, Beam, make, etc.) 

  • Advanced Linux experience (Bash, CL tools) 

  • English language (B1+) 

The following will increase our interest: 

  • Experience with programming in C++ 

  • Skills and knowledge in math and statistics 

  • Experience with SQL, NoSQL and Graph databases  

  • Experience with big data tools (Hadoop, Hive or Spark, etc.) 

  • Experience with search systems (Elasticsearch, Lucene, Solr) 

  • MS or PhD degree related to computer science, data science or statistics 

  • Publications in related domain 

  • Experience on projects with deep learning, natural language processing or information retrieval 

  • Experience in Machine Learning 

 What we offer :

  • Open and flexible work environment
  • Possibility to grow in data science and DL/ML engineering
  • Development of own unique AI-driven products that work out-of-the-box and loved by world top companies
  • Great colleagues and open atmosphere at workplace
  • Knowledge and discoveries sharing inside and outside the team
  • Participation in international workshops and conferences
  • ‘Science Friday’ program for self-development
  • Continuous education with invited tutors and paid online programs 

 Employee benefits:

  • 28 days of annual leave

  • Health insurance for you and family members

  • Business travel insurance

  • Employee stock program

  • Paid medical leave

  • 6 days-off in a year

  • Sport activities reimbursement

  • English classes

Подписывайтесь на наш телеграм-канал @remotelist, чтобы всегда быть в курсе новых вакансий! Дайджесты с новыми вакансиями появляются каждые 2-3 часа.

Еженедельная рассылка топ-15 самых просматриваемых вакансий сайта. Письмо приходит каждое воскресенье.