apprentice data scientist
From September 2022 to September 2023, I did an apprenticeship as a member of the Innovation Desk team at the Scoring Center in the filial Risk of BPN Paribas Personal Finance located in Merignac. Through this year of apprenticeship, I wanted to increase my skills in Data Science, discover new techniques and learn more about the working environment of a big company.

Member of the innovation desk team of PF RISK at the Scoring Center
Research on different categorical variable encoding techniques for Machine Learning models
Development of credit card granting scorecard for an entity in South Africa (logistic regression)
Development of classification models in Machine Learning
Data analysis and management
details
The aim of this apprenticeship was, in a first time, to assist the data scientists of the team by doing a research work about categorical encoders. And then, having a more business-oriented mission by building a scorecard using logistic regression to identify “bad debt” individuals on credit cards for a filial in South Africa.
For the first project, the data preprocessing has a very important influence when it comes to the Machine Learning results. Most of the models can’t handle categorical variables (like RandomForest models which are mainly used at the ScoringCenter), except if we change them into numerical values. The model performances will rely on how we will encode those data. Depending on the method used, the predictions can be highly impacted. And so, the precision might be increased or decreased. Through this research work I learned new ways of encoding techniques (and their impacts) that can be usefull for data scientists.
For the second project, the Scoring Center teams often use Logistical Regression models. The aim of these models is to attribute a “score” based on the input information of a client. It is a grade based on the values of his explanatory variables (for example: age, Socio-Professional Category, income). Through this porject I learned how to build and present a scorecard model. Working on a real business project with a real business goal was very nice. I also learned a lot on the methodology of a business case and how to document your work.
check out my report