Author: Shashank Raj (Business Analyst) and Kavita Yadav (HR-SME)

HRM teams put constant efforts to improve their hiring process to bring in the best talent into the organization. Even when hiring managers focus on behavioural and cultural-fit aspects of any candidate along with impressive experience and skill sets, many times the HR teams are unable to evaluate the long-term success of a future candidate, leading to high voluntary attrition.

Problem Statement:  

Organisations invest significant resources in hiring & training new employees, along with running training programs for their existing employees. All of this is done with presumption of improving employee productivity, with a significant gestation period. High voluntary attrition can be detrimental to both the organisation’s growth as well as the existing employees’ morale, business continuity and contributes to a significant impact on the bottom line.   

Business Need:

The key objective of the solution is to come up with:

  • A classification model to predict the chances of an employee leaving the organisation which can be used by the HRM team to know the requirement of resources beforehand and at the same time improve the policies for their employees
  • Create intervention strategies based on employee segments

Proposed Solution:

The HRM team needs to identify employees at high risk of attrition & thereby creating timely intervention to prevent voluntary attrition. Employees would be scored on a monthly basis to understand propensity of attrition in the coming month, thereby giving an advance signal to HRM & Managerial Teams with time to intervene.

Example:- Let’s assume that training of new employee costs 2000$ and if we can predict which employee is going to leave next month, and propose him/her a bonus program worth 500$ to keep him for next 6 months, we can keep experienced, well-trained employee under the hood, with higher morale.

This frequently updated ML Score will be a significant tool in combating attrition, as companies can design retention strategies accordingly, with direct impact on the bottom line. 

Indicative attributes: The more exhaustive the attributes are, the more accurate our model is in classifying the employees:

  • Age
  • Business travel frequency
  • Daily rate
  • Department: Sales, Research & Development, Human Resources, Marketing etc.
  • Distance from home
  • Educational qualification
  • Employee count
  • Environment satisfaction
  • Gender: Male, Female
  • Hourly rate
  • Job involvement (Feedback)
  • Job level & Role
  • Job satisfaction from employee surveys
  • Marital status
  • Monthly income
  • Monthly rate
  • Number of companies worked
  • Appraisal hike
  • Performance rating & Percentile
  • Standard hours: True or False
  • Stock option level if Applicable
  • Total working years
  • Work life balance survey
  • Years at company
  • Years in current role
  • Years since last promotion
  • Years with current manager
  • Market salary benchmarking 

Our Analytical Approach:

Key Steps Involved:

  • Standardize the data provided by the client
  • Perform statistical analysis to study impact on attrition
  • Use the information gained from the above analysis to create a Machine Learning model to predict attrition rate
  • Integrate model with data stream for monthly run

Building Machine Learning (ML) Model:

  1. Gathering the data: We had the data of employees for the last 4 years. The data contained basic details such as age, gender, educational qualification, address, place of residence, and the professional details such as date of joining, experience, skills, projects worked, designation,year end reviews, reasons for resignation (in case of ex-employees) etc.
  2. We ran association analysis on the historical data, to check for association between the variables. 
  3. Building the model: Tool – Python
  • Divide the dataset into train and test data. We used 70% data to train the model and it is tested with 30% of the data
  • Classification algorithms used were Random Forest, Decision Tree, Gradient Boosting
  • The models were validated using the test data for accuracy and then the champion model was selected

The model scores the existing employees on a scale of 0 and 1. 0 indicates that the employee is least likely to leave the company and 1 indicates that employee is most likely to leave the organisation.


HR Analytics is no longer a luxury for organisations, it is now to be seen as an essential ingredient for success. ML will play a crucial role in the evolving path of HR Teams of every organisation.

The attrition prediction model will lead to an overall profit or savings in the Human Resource Management process which includes hiring of new employees, retention, amount spent on the training and development of new employees. Attrition causes loss to the project due to loss of valuable employees. It will prescribe the organisation on the further actions to be taken. Thus, organisations can keep track of the amount of budget it has spent on human resource management and budget to be spent on future and take necessary actions. Through these data driven models they can forge long term engagement with the employees.

I agree to have my personal information transfered to MailChimp ( more information )
Join over 3.000 like minded AI enthusiasts who are receiving our weekly newsletters talking about the latest development in AI, Machine Learning and other Automation Technologies
We hate spam. Your email address will not be sold or shared with anyone else.

Leave a Reply