How to quickly find and not lose AI and Data Science specialists

In collaboration with Anna Perova


Introduction


Every day, humanity creates, uses and stores huge amounts of data. Every article, post in a blog or instagram, every Like and indeed every fact of communication - data that, when processed, becomes valuable, brings profit and warns against the risks of the one who owns them and knows how to extract the relevant information.


With the growth of data analysis and awareness of the usefulness of existing archives, the need for experts in Data Science, machine learning and artificial intelligence (AI), able to work with the data and create on their basis useful models, as well as self-processing systems and forcing them, increases. work.


Why is it necessary for those who are recruiting teams in this area to think about new recruiting methods?


As back in 2015, they wrote on TechCrunch, according to Mckinsey , who, admittedly, were not far from the truth, 490,000 specialists will be required in this area by 2018.


If you rely on LinkedIn data - out of 236 million profiles, about 11,400-19,400 are Data Scientists profiles.


Already, Amazon's average annual investment in AI Hiring is $ 227.8 million , while Google's key competitor investment in AI hiring is $ 130.1 million . Experts in the field of artificial intelligence leading companies receive from $ 100,000 to $ 500,000 per year. This is evidenced by data from the survey, which conducted The New York Times, and in principle is checked periodically comes across either on dice.com, or on monster.com, or on LinkedIn.


The area is new and in trend. The quantity and quality of young specialists does not satisfy the highest need for them in the whole world, as well as here in Russia - here the situation differs only by the order of salaries and by the number of open vacancies in the field of Data Science & AI.


According to the analysis of hh.ru, the number of open vacancies in the field of Machine Learning, Deep Learning, Data Science: more than 1000. The number of ready-made specialists with the necessary experience is not more than 300. Candidates with at least minimal experience in this area of ​​AI, Data Science are not suitable for these positions are about 3 thousand. And this in itself is a problem to search and hire because:



All this leads to an extremely overheated labor market, and hiring in this area must take into account a number of factors:



Due to the high competition for talents in this area, a number of selection questions arise, the main of which are:



1. Where to find?


In addition to the standard and well-known sources, I would like to draw attention to the most productive in terms of my personal experience in hiring AI & Data Science specialists.



Useful site for searching candidates by citation index: eLIBRARY.ru.
This site contains publications of Russian scientists. There are more than 24 million articles posted, the database is constantly updated.
One of the main lafhacks is to register on the site, then find a professor with a large number of publications with a high level of citation, find a way to contact him and ask for recommendations from co-authors and students. As an option - open publications and connect with collaborators through available social networks.
When hiring scientists, it is important to bear in mind that they may lack practical skills, an understanding of business, but perhaps their scientific career may be useful for the development of high-tech projects, including in the field of AI.



2. How to select really good Data Science & AI specialists


For HR, it is not easy to understand all the concepts at once, so the most important thing is to understand the main headings well in order to at least somehow navigate. And to act in accordance with the instruction (chapter “FINAL LIST, or the Principles of Personnel Selection”) - i.e. very clearly balance the complexity of work and trials with financial and non-financial motivation.


So, for a start, it is important to determine what is now understood by Data Scientist


Data Scientists use statistical data, machine learning and analytical approaches to solving critical business problems. Their main function is to help organizations transform their volumes of big data into valuable and efficient models.


They should know mathematics well, program, develop machine learning algorithms for automation of algorithms. They are also expected to have a high ability to interpret data, it is important to be able to visualize them, problem-solving skills are important, even if the problems are not fully worded.


It is important that they can work with different types of data and data of different levels of readiness.


A good mathematical background (knowledge of linear algebra, analytic geometry, probability theory, and mathematical statistics) is a must. And this is even more important for data analysis than engineering knowledge. Learning ML models requires an understanding of which particular models need to be used, how to interpret, and how to improve the results.


Knowledge of programming languages : Python or R (but to navigate the technological stack you used); C / C ++; Java
Skills : Scala, Apache Spark, Hadoop, machine learning, deep learning, and statistics.
Optional : Tensorflow, PyTorch, Keras, Caffe, Pandas etc., Jupyter, and RStudio., Experience with highly loaded systems, Cuda.


The difference between Data Scientists and Data Engineer is the ability to not only analyze data, but also integrate them into existing systems. In this connection, deep knowledge of programming languages ​​is particularly important, as well as the experience of creating or participating in the creation of highly loaded, multi-threaded systems, etc.


Key concepts with which it is desirable to be familiar to a recruiter: Machine Learning, Deep Learning, Data Science, Data Mining, Computer Data Recognition, Car Recognition, Face Recognition, Health Systems, Natural Language Processing, kaggle contests.


Filtering candidates based on telephone HR interviews:


  1. It is important to understand how deep the candidate’s knowledge is in mathematics (linear algebra, probability theory)
  2. What frameworks does it use? A diverse experience is welcome.
  3. What are the most complex projects you had to create? What was the personal role and result?
  4. What competitions did you take part in?
  5. Are there articles in scientific journals here at habr.com?

Algorithm of recruiting and selection of candidates:


  1. The technical interview consists of 3 parts:
    • Online testing for 20 minutes. An example of a site for placing an online test. ;
    • Testing - 1 hour. Technical interview in the office. Test task 20 min-1 hour. You can create a test of 10-15 tasks (problems in probability theory, mathematical statistics, computer vision, machine learning). The test is performed by the candidate alone in the meeting room. He does not have to solve all the problems, but it is important to solve at least 50%. In testing it is useful to put points for an objective assessment and the ability to compare candidates;
    • The oral part of the technical interview is 1 hour (discussion of the results of problems in probability theory, mathematical statistics, and analysis of how the candidate is related to solving computer visual problems, machine learning).

At the same time, it is necessary to understand that the working conditions and other “buns” are known to the candidate and honestly announced in advance, otherwise not everyone will have the motivation to take the test.


  1. HR & Personality Interview with Team Leader
    Personality traits that are necessary for DataScientist:
    • High learning It should be smart, quickly acquire new skills, be prepared and constantly develop in their field and preferably in the subject area of ​​the company.
    • Curiosity, interest in new technologies, practical experience of their use, interest in related areas.
    • Agility and perseverance - the ability for a long time to work on one problem
    • Creativity - interest in new opportunities, motivation and the ability to come up with new solutions.

How to keep AI & Data Science specialists in the company:


Here, the standard retention tools have their own characteristics.



And human values:



In conclusion, it is worth noting that it is important to know that the difference between these vacancies from the rest - the previous recruitment methods for these candidates do not work so effectively. It is important to keep a balance between the extreme shortage of specialists, the willingness to be more flexible in conditions and the need to filter and select strong professionals who can make a positive contribution to changes in business.

Source: https://habr.com/ru/post/412625/


All Articles