Credit Scoring

The looming credit-gap in a post-Covid world

As the economic effects of Covid19 reverberate around the world, it is clear that there will be an impact on credit scores, and that impact will be felt differently in developed and emerging markets.

“LenddoEFL has always, since inception, focussed on emerging economies. Fundamentally, we take alternative data sets and we use them to build credit-risk models for areas of the markets that don’t have traditional credit bureau scores. And we do this in markets where there are no credit bureaus, or where the bureaus only cover 10 - 30% of the market.

In developed markets, credit-bureaus already exist, generating scores for financial institutions which are used to make credit-risk decisions based on historical repayment behaviour. But there is suddenly an unprecedented spike in unemployment - we’re seeing upwards of 20 million people in the US alone and the numbers are climbing - and that’s the same in many countries.

We’re in an interesting dynamic now. Lockdowns are having a real impact on consumer income and we could be in a place where we see a ‘black-hole’ as it relates to credit-scores.

There is a massive change in the market. People’s lives have changed, very unexpectedly. And, sooner or later, we are going to see a spike in defaults. And at a certain point, this unexpected event will damage traditional credit-risk scores.

When the economy does start to restart, and when the governments do start encouraging banks to lend again, they will go to the credit bureau to ask for credit-scores on borrowers, and they are going to get a lot of challenging results.

At LenddoEFL, we do believe that this pandemic will see a lot of financial institutions push towards new digital solutions. Banks have been encouraging their users for some time to embrace digital solutions, online banking. Well now those branches are closed and people are picking up online banking in a way they haven’t before. And that’s a positive thing.

We think there will now be an increased reliance on using new sources of data; such as social data, mobile phone data, psychometric data etc. These data sets are going to be increasingly important in predicting someone's repayment behaviour following this anomaly of significant default that is coming.

In a developed market, someone who had a perfect credit score might soon have a 3 month gap.

Additionally, in these western markets, where governments are providing relief, there is the possibility they are ‘kicking the can down the road’ and consumers are going to be faced with some type of ‘balloon payment’ in the coming months which could be insurmountable.

In an emerging market, we look at people who have no credit score, and LenddoEFL solutions can generate a score for them.

At LenddoEFL, one of the things we do is, we look at behavioural science. We look at the behaviour of the individual at a snapshot in time - now - and use that behavior to predict the repayment of an individual. We look at thousands of data points from different sources to bring out significant information that historical repayment data doesn’t show.

The pandemic is creating a great opportunity for emerging economies to take a step forward. There is going to be capital available to be deployed. At LenddoEFL, we have seen that in emerging markets there is a greater acceptance of these types of new solutions. Emerging markets have the opportunity to push harder and faster to get out of this.”

Photo by Gabe Pierce on Unsplash

Replicating Psychometric Profiles Through Mobile Phone Data to Assess Credit Risk Abstract 

The big mass of financially underserved individuals across the globe is receiving increasingly considerable attention and led to the development of innovative solutions allowing people to use their digital profiles and personality traits to increase their financial options. On one hand, individuals with little to no credit history are empowered to choose if and when to use their own digital data to access the financial services they need. On the other hand, financial institutions across emerging markets are able to predict risk using non-traditional data sources to maximize approvals, reduce risk and, finally, improve access to financial services. However, not all alternative data sources are obtainable for every market, and historical credit repayment information is not always available to facilitate the training or recalibration of credit risk models fed by a particular data source. 

jens-johnsson-Ae5jmF2PJXg-unsplash (1).jpg

The replication of psychometric profiles through mobile phone data shared by credit applicants enables credit risk assessment through either a psychometric or a mobile profile, alternatively without the constraint of repayment information availability, given the existence of loan performance data collected for any of these data sources for the same market. Using clustering techniques, well defined psychometric profiles are derived for individuals for whom loans were disbursed in Mexico, each associated with different credit risk levels. Afterwards, personality traits associated with these profiles, such as impulsiveness or extroversion, are replicated through phone usage data related to installed mobile applications, calendar events, call logs and phone contacts. Finally, psychometric clusters are rebuilt based on mobile phone traits. Risk sorting power of these traits is validated through loan repayment information available for a different group of credit risk applications in Mexico for whom Android data have been collected. 

In this study, it is shown that psychometric and Android data can be used alternatively to predict risk, based on specific personality traits, extending the value of alternative data for credit risk assessment to market with technological or time information access constraints. The research could open the other to a big set of non-explored solutions to keep improving access to credit reducing process friction and increasing user adoption. 

 

Want a copy of the complete white paper? Please sign up below and we will send you a copy.

Blog | The LenddoEFL Assessment Part 2: Measuring how people answer questions with metadata

By: Jonathan Winkle, Manager of Behavioral Sciences, LenddoEFL

The last post showed how our psychometric content reveals people’s personality traits, but our assessment also captures an abundance of metadata. Metadata is information about how people process the questions and exercises they complete. Here are some examples.

  • How long did an applicant take to answer a question compared to their average response time?

  • How many times did an applicant change their mind and switch their response before submitting their answer?

  • Is the applicant’s information consistent with their written request to the financial institution? (e.g., requested loan amount)

By measuring metadata, LenddoEFL’s approach goes beyond what is possible in traditional credit applications to reveal more information about applicants. Consider the following question from our test:

Which BLUE person is most like you?

image10.jpg

For this question, we consider how long it took the applicant to slide to one answer or another and whether they changed their opinions in the middle. Someone who is confident that they are an organized person should move the slider in only one direction and relatively quickly. Quick, smooth answers belie confidence, whereas slow, wavering responses demonstrate uncertainty.

The relationship between response time and default rate can be complex. Consider another psychometric exercise:

image7.png
image2.png

In this case response time was a non-linear predictor of default, where both slow and fast response times were associated with a greater credit risk!

There are many ways to interpret response time metadata. If an applicant answers a question quickly, are they confident or are they cheating? If they are taking a long time to respond, are they having difficulty understanding the question or putting extra effort into getting their answer right? By collecting metadata across all questions, we can compare a single response time to the applicant’s overall response time distribution to differentiate things like confidence and cheating (see graph below).

An example distribution of response times generated from artificial data

An example distribution of response times generated from artificial data

Conclusion

Metadata reveals another layer of behavior on top of the personality traits we target and can be used to identify features such as confidence, cheating, and confusion. These behavioral traits can be used for predicting default and ensuring that we are collecting high quality data for our models.



Blog | Turning Gini into Profits

Written by Rodrigo Sanabria, Director Partner Success, Latin America

On a prior post by Carlos del Carpio (“The Economics of Credit Scoring”), we discussed the business considerations to assess the merit of a risk model. In this post, I will address how a good origination model impacts the bottom line of a company’s P&L.

These principles may be adapted to look into other types of models used at later stages of a loan life, but on this post we will only address loan origination.

From a business point of view, an origination model is a tool that helps us aim at the “sweet spot”: where we maximize profits. A simple way to think about it is as a trade-off between the cost of acquisition (per loan disbursed) and cost of defaults (provisions, write-offs): The higher the approval rate, the lower the cost of acquisition, but the number of defaults go up.

How do we go about finding the sweet spot? I’ll try to explain it below.

Figure 1

Figure 1

A good model has a good Gini. A “USEFUL” model creates a steep probability of default (also known as PD) curve – we usually refer to it as a “risk split”.

 

Figure 1 shows the performance of a model based on psychometric information used by an MFI. The Gini (not shown in the graphic) is pretty good (0.28). The risk split is great: the people in the lower 20% of the score ranking are about 9 times more likely to default than those in the top 20%.

 

Knowing the probability of default for a given group, we may set a credit policy. Basically, we need to answer: “what would the default look like given an acceptance rate?”

 

Figure 2

Figure 2

 

We have re-plotted the same data in Figure 2, but now we express the probability of default in accumulated terms. Basically, the graph shows that if we were to accept 80% of this population sample, we would have a 4.5% PD, but if we were to accept 40%, the PD would go down 2 points to 2.5%.

Now, from a business point of view, we still do not have enough information to decide. Do we?


 

Where would the profit be maximized?

The total cost of customer acquisition is mainly fixed. Whatever we spend on marketing and sales to attract this population, will not change if we reject more or fewer applicants. So, the cost per loan disbursed would grow as we reduce the acceptance rate.

Of course, the higher the acceptance rate, the larger the portfolio, and the more interest revenue we get. BUT, the higher the provisions and write-offs. The combination of these 2 variables (cost of acquisition and net interest income) produces an inverted U-shaped curve that uncovers the “sweet spot”

Figure 3

Figure 3

The current credit policy is yielding a profit at 100% acceptance rate (see Figure 3) because the sample being analyzed corresponds to all the customers that were accepted (i.e. we have repayment data about them). So, the portfolio is profitable.

But the sweet spot seems to be shy of 60% acceptance rate. If this FI were to cut down its approval rate to that level, profits would increase by about a third, and its return on portfolio value would almost double. Of course, there are other considerations around market share and capital adequacy that may play a role in such a strategic decision, but the opportunity is clearly uncovered by the model.

 

In my experience, the sweet spot usually lies within 30%-70% acceptance rates, driven by marketing expenditures, interest rates, cost of capital, sales channels, and regulation.

What if the shape of the curve shows a continuous positive growth? The sweet spot is at a 100% acceptance rate! – have we reached risk karma? – Most likely, the answer is no (but almost!).

Figure 4

Figure 4

Most likely, we are leaving money on the table. Some business rule may be filtering people before they are scored. I have experienced this situation while working with lenders. For example, a traditional bank was filtering out all SMEs that had been operating for less than X years. This bias in the population was creating a great portfolio from a PD point of view, but there was clearly an opportunity to include younger businesses. As you can see in Figure 4, the maximum return on the portfolio was achieved at 60% approval rate, but they could increase profits by approving beyond the current acceptance rate. Depending on their cost of capital, it may be a good idea to expand the portfolio by approving more people.

In summary, think of your origination model as a business tool. Don’t stop at looking at Gini to assess a model’s merit. Understand how your profitability would be impacted by changes in your acceptance rate. If the PD curve is steep enough, you may capture quite a lot of value by applying the model to either reduce or increase your acceptance rate.

Blog | On the use (and misuse) of Gini Coefficients in Credit Scoring: the Economics of Credit Scoring

This is the fourth part of a series of blog posts about Ginis in Credit Scoring. See also part 1, part 2, part 3.
image4.png

Gini Coefficients and the Economics of Credit Scoring

On a global scale, billions of dollars in debt are granted every year using decisions derived from credit scoring systems. Financial institutions critically depend on these quantitative decision to enable accurate risk assessments for their lending business. In this sense, as with any tool that serves a business purpose, the application of credit scoring is not ultimately measured by its statistical properties, but by its impact in business results: how much can Credit Scoring help to increase the benefit and/or to decrease the cost of the lending business.

Assessing Credit Scoring from a business perspective could sound pretty obvious. However, given the typical compartmentalization of roles that could exist at lending institutions, where Risk and Modeling teams can be completely separated from Commercial departments, it could be easy sometimes to focus too much on the statistical aspects of credit scoring such as Ginis, and forget the ultimate business nature of its purpose. Although there is a clear positive relationship between economic benefits and predictive power, there are also certain elements that can affect the balance between costs and benefits. In this post, we discuss some of these elements and explain their role in the cost-benefit analysis of credit scoring.

 

The benefits of credit scoring

The benefit of credit scoring derives from its ability to accurately identify good customers, and discriminate them from bad customers. The more good customers a model can identify, the greater the interest income that can be generated from a credit portfolio. And the more bad customers it can discriminate, the lower the losses for the credit portfolio. In this sense, the economic benefit of credit scoring can be amplified by two things: the volume of customers, and the size of the credit disbursed to these customers.

Take for example the portfolio of microfinance institution “A” with several thousands of customers but very small loan amounts, and compare it against a smaller microfinance institution “B” providing loans of the same size to a portfolio of just a few hundred customers. Both institutions can see a similar increase of 1% in the predictive power of their credit scoring models, however, the increase in economic benefit yielded from this increase in predictive power will be different just because of the different sizes of portfolio volumes. Everything else being equal, the higher the volume of the portfolio, the higher the potential economic benefit of credit scoring.

The same can be argued for the size of credit disbursed to the customers of a portfolio. For example, take an SME lending institution with just a few thousands of customers but with relatively high credit amounts in the hundreds of thousands of dollars. An increase of 1% in predictive power could bring just a handful of new good clients into the portfolio, or avoid the disbursement of a handful of very bad loans. However a change in just a handful of good or bad clients can be enough to generate a considerable increase of economic benefit in the portfolio given the large size of the loans.

 

The costs of credit scoring

The costs of Credit Scoring can be split in two parts. First, the cost of developing a new model, and secondly, the cost of implementing and maintaining credit scoring models.

If we assume lending institutions are at a stage of technological maturity in which all the necessary data to create a credit scoring model exists and is continuously updated with certain level of quality and integrity, then the first type of cost just depends on the complexity of the modeling process. The whole process of building a model includes data extraction and cleaning, feature engineering, feature selection and the selection of a classification algorithm.

Depending on the lending institution, this process can be handled by a single data scientist (e.g. think of the CRO of a small Fintech startup), or it can be handled by a large department including many different teams with different roles such as data engineers, data scientists and software engineers (e.g. think of a large multinational bank). At the same time, the teams in charge of the model building process can be comprised of junior analysts fresh out of college using well-known standard techniques or include teams of PhDs in computer science doing advanced machine learning. At the end, the cost involved in developing the credit scoring models will depend on how much complexity and sophistication can be afforded and/or needs to be put into the process.

Once the model has been built, it also needs to be implemented and monitored over time. The costs involved are not trivial. Again, they will depend on the stage of technological maturity of the financial institution and the complexity and sophistication required. For example, in some cases the implementation of a credit scoring model can be as simple as creating an Excel calculator loaded with the coefficients of a logistic regressions where some values are manually inputted by a Loan Officer to get a score (e.g. think of a small MFI in the rural area of a developing country). Or it can be as complex as a Python package in a cloud-hosted decision engine integrated in the online platform of a large bank. The handling of big data, software development and testing, as well as the security and legal aspects involved in the deployment of a credit scoring system can considerably increase its costs. And all this, without even considering if the teams that will monitor the performance of the models implemented on a defined frequency basis are dedicated full time, or they are just the same team that also did the modeling and/or deployment.

 

Bottom-line:  The statistical classification accuracy measured by Gini coefficients are indicative of some part of the benefits of using credit scores, but they are not the most important nor the final metric when assessing the cost-benefit of credit scoring. The reason is because the benefits of credit scoring can be influenced by the volumes of customers and the size of the credit. And the costs of credit scoring ultimately depends on the stage of technological maturity of the lending institution, as well as how much complexity and sophistication can be afforded and need to be put in the development, deployment and monitoring of credit scoring models.   

So next time you need to make a decision about using Credit Scores to boost your lending business, ask how much they can help to increase the benefits of the business, and how much they can help to decrease its cost. The final decision will depend on a lot more than just Ginis.

 

At LenddoEFL, we have the expertise to help you boost the benefits and reduce the costs of credit scoring using traditional and alternative data. Contact us for more information here: https://include1billion.com/contact/.

 

Caja Sullana provee a jóvenes emprendedores acceso a crédito en alianza con LenddoEFL, en el marco de proyecto con Fundación CITI y COPEME

20180516_182214.jpg
Citi Logo.png

Lima, Peru, 16 de julio del 2018 – Organizaciones se han unido para financiar una alternativa innovadora de evaluación crediticia para incrementar el acceso a financiamiento a jóvenes que no cuentan con posibilidades de acceder a financiamiento que recién están comenzando un negocio.

Cuatro instituciones se han unido para invertir y avanzar en la innovación liderada por jóvenes en el Perú. Los jóvenes emprendedores que tienen dificultades para acceder a crédito debido a la falta de historial crediticio, ahora pueden solicitar un préstamo de negocios de la institución financiera peruana Caja Sullana usando la evaluación de crédito psicométrica de LenddoEFL, una fintech que potencia las decisiones basadas en datos alternativos para promover la inclusión financiera.

Fundación CITI financia esta iniciativa como parte de sus esfuerzos para impulsar las iniciativas empresariales de los jóvenes en los mercados emergentes. COPEME, una organización peruana que promueve la inclusión financiera, gestiona este proyecto como potencial para expandir esta tecnología a otras instituciones financieras del país.

Caja Sullana, que actualmente usa la evaluación psicométrica de LenddoEFL tanto en agencias como online, buscaba una forma de aprobar más personas con poca información crediticia de manera más sencilla. El proceso previo de evaluación crediticia implica visitas y análisis por los oficiales de crédito que consumen tiempo, que muchas veces tienen como resultado el rechazo del préstamo. Con la evaluación de LenddoEFL, Caja Sullana puede agilizar su proceso de solicitud de crédito, reducir la carga de trabajo para los oficiales de crédito, y tomar decisiones más informadas sobre los solicitantes con poca información.

“Citi Perú está comprometido con el empoderamiento económico de las comunidades donde vivimos y trabajamos, por eso promovemos este programa que fortalece a las microempresas y promueve las micro finanzas. Este proyecto constituye una excelente iniciativa para estimular el uso de tecnologías y para promover la inclusión financiera en las comunidades más alejadas”, señaló Camila Sardi, Head de Asuntos Públicos de Citibank del Perú.

"Dos de nuestros ejes estratégicos son el apoyo a la inclusión financiera de más peruanas y peruanos, en particular de zonas rurales y peri-urbanas, y la implementación de soluciones innovadoras que mejoren la eficiencia de las instituciones de micro finanzas: En ese sentido, el proyecto ejecutado con el apoyo de Fundación Citi, se suma a las acciones que en el marco de estos dos ejes desarrollamos en el país, habiendo encontrado en Caja Sullana y LenddoEFL, dos organizaciones cuyo alcance, experiencia y objetivos facilitan la consecución del propósito de su diseño y puesta en marcha: la incorporación al sistema financiero de jóvenes emprendedores a través del empleo de una herramienta disruptiva que estamos seguros tendrá un impacto significativo." afirma Carlos Ríos Henckell, Gerente General de COPEME.

“Tenemos como objetivo atender a los segmentos más jóvenes y ofrecerles esta nueva opción para ingresar al sistema financiero, considerando su perfil como emprendedores en potencia. La falta de historial crediticio dificulta el acceso a herramientas de desarrollo, por lo que nos esforzamos en promover la inclusión financiera y ser el soporte económico que ellos  necesitan”, expresó el presidente del Directorio de Caja Sullana, Joel Siancas Ramírez.

Además, agregó “nuestro sentir como institución siempre ha sido acompañar a los ‘peruanos guerreros’ en el crecimiento de sus proyectos y ser parte importante en la historia de su éxito”.

“Trabajamos con algunas de las mayores instituciones financieras de Perú y América Latina, esta es una oportunidad única de servir a la inclusión de jóvenes emprendedores. La evaluación de LenddoEFL ofrece una forma poderosa de incluir a más personas en el sistema financiero, y estamos entusiasmados de asociarnos con COPEME, Fundación Citi y Caja Sullana para servir mejor a los jóvenes emprendedores de todo el país”, señaló Rodrigo Sanabria, Director Partner Success, América Latina, LenddoEFL.

Acerca de Citi
Citi, el banco líder global, tiene aproximadamente 200 millones de cuentas de clientes y realiza negocios en más de 160 países y jurisdicciones. En el Perú, Citi ofrece a corporaciones, gobiernos e instituciones una amplia gama de productos y servicios financieros, incluyendo servicios bancarios y de crédito, servicios bancarios corporativos y de inversión, corretaje de valores, servicios de transacción y administración patrimonial. Por información adicional, visite: www.citigroup.com 

Acerca de COPEME
Somos una organización que desarrolla actividades y provee servicios para el fortalecimiento del sector microfinanzas, el desarrollo de la Mype, y el fomento de la inclusión financiera. Trabaja en Perú desde 1991, alcanzando sus acciones a microfinancieras de todo el país, empresas privadas, organismos públicos, proveedores de fondos, inversionistas y otros actores relacionados al segmento Mype y de microfinanzas. http://www.copeme.org.pe/

Acerca de Caja Sullana
Somos la Caja Municipal de los emprendedores con norte, tenemos ya más de 30 años en el Sistema Financiero regulados por la Superintendencia Nacional de Banca y Seguros. Actuamos bajo la forma de Sociedad Anónima, con el objetivo de captar recursos y utilizarlos para brindar diferentes servicios financieros, preferentemente a las pequeñas y micro empresas, contribuyendo así al desarrollo económico en las diferentes regiones donde operamos, siempre comprometidos en ofrecer estos servicios con alto sentido de Responsabilidad y Calidad. Más información sobre nosotros o nuestros servicios: http://www.cajasullana.pe.

Acerca de LenddoEFL
Nuestra misión es proveer a mil millones de personas acceso a poderosos productos financieros a un menor costo, más rápido y conveniente. Usamos Inteligencia Artificial y Análisis Avanzado para traer las mejores fuentes de digital y psicometría para ayudar a las instituciones financieras en países en desarrollo para atender en confianza a las personas que no están bancarizadas y pequeños negocios. A la fecha, LenddoEFL ha proporcionado productos como puntajes crediticios, verificación e Insights a más de 50 instituciones financieras, ayudando a siete millones de personas e impulsando el préstamo de dos mil millones de USD. Para mayor información, visite https://include1billion.com/.

Blog | On the use (and misuse) of Gini Coefficients in Credit Scoring

image1.png

Over years of blogging, one of our most popular ever blog posts was about the Gini coefficient. In this series of posts, we revisit the Gini and dig further into its uses and the ways we see it misused in credit scoring.

What is a GINI?

For lenders around the world, the “Gini Coefficient” is an often heard, sometimes feared, and frequently misunderstood statistical measure. Commonly used to assess things like wealth inequality, Gini Coefficients are also used to evaluate the predictive power of credit scoring models. In other words, a Gini Coefficient can help measure how good a credit score is at predicting who will repay and who will default on a loan: the better a credit score, the better it should be at giving lower scores to riskier applicants, and higher scores to safer applicants.

Though calculating a Gini Coefficient is complex, understanding it is fairly simple:

A Gini Coefficient is merely a scale of predictive power from 0 to 1, and a higher Gini means more predictive power.

However, there are a few key aspects of Gini Coefficients that are not always well understood and can lead to their misuse and wrong interpretation. Over this series of blog posts we’ll discuss four of them:

  1. People often compare Ginis when they should not. The only useful comparison across Ginis (or AUCs) is when looking at different scores over the exact same data. 

  2. People forget that Gini will vary by acceptance rate. When presented with a Gini coefficient, always keep an eye on the effect of the acceptance rate.

  3. People focus on Ginis, but are not always aware of its impact on the costs, benefits and overall economics of Credit Scoring.

  4. People do not fully understand and often overestimate the role of Gini in the business of lending.

 

About the Author:

Carlos del Carpio is Director of Risk & Analytics at LenddoEFL. He has 10+ years of experience developing credit scoring models and implementing end-to-end credit risk solutions for Banks, Retailers, and Microfinance Institutions across 27+ countries in Latin America, Asia and Africa.

About LenddoEFL

LenddoEFL’s mission is to provide one billion people access to powerful financial products at a lower cost, faster and more conveniently. We use AI and advanced analytics to bring together the best sources of digital and behavioural data to help lenders in emerging markets make data-driven decisions and confidently serve underbanked people and small businesses. To date, LenddoEFL has provided credit scoring, verification and insights products to over 50 financial institutions, serving seven million people and lending two billion USD. For inquiries about our products or services please contact us here.

Blog | Raising the Stakes on Psychometric Credit Scoring

An updated and expanded 2nd edition (first edition)

Why read this post?

Learn why high-stakes data is essential for building accurate credit-scoring models.

 

Introduction

Billions of people lack traditional credit histories, but every single person on the planet has attitudes, beliefs, and behaviors that can be used to predict creditworthiness. Quantifying these human traits is the focus of psychometrics, and the alternative data provided by this technique allows LenddoEFL to greatly expand financial inclusion in its mission to #include1billion.

But there is a catch: in order to build models that accurately predict default, applicants need to complete psychometric assessments in pursuit of actual financial products, a so-called “high-stakes” environment. This is because people answer psychometric questions differently when they have a chance to receive a loan (the high stakes) than they would in a hypothetical situation with no incentive (the low stakes).

Despite this fact, psychometric tools are frequently built using low-stakes data. For example, many companies develop psychometric credit scoring tools using volunteers. And many lenders want to validate psychometric credit scoring tools on their clients through back-testing: giving the application to existing clients and comparing scores to their repayment history, again a low-stakes setting.


These approaches are only valid if low-stakes data can be applied to the real world of high-stakes implementation, where access to finance is on the line for applicants. But it turns out that this is not the case. A recent study published by our co-founder Bailey Klinger and academic researchers proved that low-stakes testing has no predictive validity for building and validating psychometric credit scoring models in a real-world, high-stakes situation. The data below shows exactly how applicant responses shift as they move from one environment to another.

 

The Experiment

To test for differences between low- and high-stakes situations, LenddoEFL gathered psychometric data from two sets of micro-enterprise owners in the same east-African country. One group already had their loans (low-stakes) and another group completed a psychometric assessment as a part of the loan application process (high-stakes).

First, the low-stakes data. The figure below shows the frequency distribution for two of the most important ‘Big 5’ personality dimensions for entrepreneurs, Extraversion and Conscientiousness, as well as a leading integrity assessment[i].
 

image1.png


You can see that when the stakes are high, people are answering the same questions very differently. The distribution of scores on these three personality measures shifts significantly to the right. When something important is at stake, like being accepted or rejected for a loan, people answer differently.

How do these differences in low- vs. high-stakes data matter for credit scoring?

To see how these differences impact the predictive value of psychometric credit scoring, we can make two models[ii] to predict default: one uses responses from applicants that took the application in low stakes settings, and the other uses responses from applicants that were in high stakes settings. Then we can use a Gini Coefficient—which measures the ability of a model to successfully rank-order applicants’ riskiness and for which a higher coefficient is a metric of success in this—to compare each model’s ability to predict default for the opposing population as well as its own.[iii]

image2.png


These results clearly show that there is a significant change in the rank ordering when models built on low-stakes data are applied in high-stakes settings and vice versa.[iv] Importantly, we can see that a psychometric credit-scoring model can indeed achieve reasonable predictive power in a real-world, high-stakes setting. But, that is only when the model was built with high-stakes data.

Think about it like this: when the stakes are high, both less and more risky applicants change their answers. But, less risky applicants change their answers in a different way than riskier applicants. This difference is what is used to predict risk in psychometric credit scoring models: the difference between how low- and high-risk people answer in a high-stakes setting.

This also illustrates why we see that a model built on low-stakes data is ineffective in a real-world high-stakes implementation. In the low-stakes setting, the low- and high-risk people aren’t trying to change their answers, because they aren’t concerned with the outcome of the test. Once the stakes are high, however, this pattern changes.

 

Conclusions

Testing existing loan clients or volunteers has an obvious attraction: speed. That way you don’t have to bother new loan applicants with additional questions, and then wait for them to either repay or default on their loans before you have the data to make or validate a score, an approach that takes years.

Unfortunately, these results clearly show that this shortcut does not work. People change their answers when the stakes are high, so a model built on low-stakes data falls apart when used in the real-world. People answer optional surveys with less attention and less strategy than they do a high-stakes application, and therefore the only strong foundation to a predictive credit-scoring model is real high-stakes application data and subsequent loan repayment.

Consider an analogy: you can’t predict who is a good driver based on how they play a driving video game, where the outcome is not important. Conversely, someone who does well on a real-world driving test may not perform that well on a video game.  Whether it is driving skills or creditworthiness, you must predict the high-stakes context with high-stakes data.

 

TAKEAWAYS:

- Psychometric model accuracy is only guaranteed when you collect data in a high-stakes situation (i.e., a real loan application).

- Despite its speed, back-testing a model on existing clients in a low-stakes setting is risky because it might not tell you anything about how the model will work in a real implementation.

- If you want to buy a model from a provider, the first thing you should verify is what kind of data they used to make their model. Was it from a real-world high-stakes implementation similar to your own?

 


[i] These are indices from widely available commercial psychometrics providers. It is important to note that LenddoEFL no longer uses any of these assessments or dimensions in our assessment, nor any index measures of personality.

[ii] Stepwise logistic regression built on a random 80% of data, and tested on the remaining 20% hold-out sample. An equivalently-sized random sample was used from the other set (high-stakes data for the low-stakes model, and low-stake data for the high-stakes model) to remove any effects of sample size on gini.

[iii] Note that this exercise was restricted to those questions that were present in both the low- and high-stakes testing. It does not represent LenddoEFL’s full set of content and level of predictive power, it is only for purposes of comparing relative predictive power.

[iv] The results also show that using standard personality items, the absolute predictive power is lower in a high-stakes setting compared to a low-stakes setting. This is likely because of the ability to manipulate some items in a high-stakes setting makes them not useful within a high-stakes setting. This lesson has lead LenddoEFL to develop a large set of application content that is more resistant to manipulation and which has much higher predictive power in high-stakes models. This content forms the backbone of the current LenddoEFL psychometric assessment, all of which is built and tested exclusively with high-stakes data and subsequent loan repayment-default rather than back-testing.

 

Sina News Taiwan | How to break the credit assessment problem? (如何破解信貸評估難題?)

FC_Fin_Inclusion_1024x1024.jpg

Bangladeshi banker and Nobel laureate Muhammad Yunus (Muhammad Yunus) the promotion of microfinance , is the poor through microcredit loans , so there is money to do a small business to support themselves, and thus get rid of poverty. However, due to the time-consuming and laborious credit evaluation of lenders, the large-scale application of microfinance is difficult to achieve once.

Nowadays, mobile banking comes. It can collect data to help people who have little formal financial records in the traditional sense to broaden their services. Labor costs are also greatly reduced. For example, Kenyan mobile telecommunications operator Safaricom and African Commercial Bank jointly launched the M-Shwari business in 2012, which can determine customers’ credit scores based on Safaricom’s user information and the trading history of its M-PESA mobile money business. Loan amount.

In addition to payment data, mobile phones (especially smart phones) can also provide more types of information for credit evaluation by borrowers . For example, a person's geographic location data can reflect whether he has a stable job and fixed residence; shopping records can even reveal whether the borrower is pregnant ; and the richness of information obtained by social media is not Yu.

The fintech start-up company Lenddo EFL also uses the Internet to conduct psychological tests on potential borrowers. The question concerns the concept of money (for example, choosing to pay $10,000 at a time, or $20,000 for six months), where your money is spent. , Evaluation of living communities, etc., to determine the reliability of testers loan repayment. To date, the company has completed more than 7 million credit assessments, helping consumers with a lack of traditional credit records to borrow 2 billion U.S. dollars from 50 financial institutions of varying sizes.



詳全文 如何破解信貸評估難題?-財經新聞-新浪新聞中心 http://news.sina.com.tw/article/20180514/26854022.html

Forbes | Could Personality Tests One Day Replace Credit Scores?

forbes article 042018.jpg

If someone gave you an unexpected $100, what would you do with it? Give it to charity? Save it? Splurge on something fun?

We see questions like this in personality quizzes online, and sometimes even when applying for jobs. Your answers are supposed to help others predict your behavior using what’s called psychometrics.

And companies looking to avoid hiring potential problem employees aren’t the only institutions interested in psychometrics. The financial industry might get in on it, too.

What if, instead of a lender checking your credit score, they gave you a personality test?

Read full article.

AstroWani | CTOS, LenddoEFL extends financial inclusion in Malaysia

30% of Malaysians with good potential is still denied access to loans. This is because they lack or directly have no credit history. In order to curb this issue, Malaysia's Largest Credit Reporting agency, CTOS Data Systems Limited, partne…

30% of Malaysians with good potential is still denied access to loans. This is because they lack or directly have no credit history. In order to curb this issue, Malaysia's Largest Credit Reporting agency, CTOS Data Systems Limited, partnered with Fintech LenddoEFL company and emerged with a new solution.

New Strait Times | CTOS & LenddoEFL partner to boost financial inclusion in Malaysia

KUALA LUMPUR: CTOS Data Systems Sdn Bhd (CTOS), Malaysia’s largest credit reporting agency, has entered into a partnership with LenddoEFL to enable access to financing for Malaysian consumers with little to no credit history.

Both CTOS and LenddoEFL have aided banks, lending institutions, utility and credit card companies to reduce risk, increase portfolio size, improve customer service and accurately verify applicants. Read full article.

Markets Insider | CTOS & LenddoEFL Partner to Boost Financial Inclusion in Malaysia

KUALA LUMPUR, Malaysia, and SINGAPORE, CTOS Data Systems Sdn Bhd (CTOS), Malaysia's largest credit reporting agency, has entered into a partnership with LenddoEFL to achieve a joint vision of financial inclusion for Malaysian consumers with little to no credit history. Both fintech leaders have aided banks, lending institutions, utility and credit card companies to reduce risk, increase portfolio size, improve customer service and accurately verify applicants. Read full article.

LenddoEFL and Orient Commercial Bank join together to serve the unbanked of Vietnam

How email and smartphone data help you get a loan

What your phone habits reveal about you

SoFi is preparing to launch in Sydney, its first market outside of the US, and earlier this year the country's first loans and deposits marketplace, Lodex, formed a partnership with Singaporean start-up Lenddo to bring its social scoring technology to the country...



Read more: http://www.afr.com/technology/how-email-and-smartphone-data-could-help-you-get-a-loan-20171212-h02zi0#ixzz534zFfQmg 

Three ways alternative data will become more mainstream in 2018

Overseas lenders can develop credit scores based on mobile and web data
FICO uses LenddoEFL’s credit scoring model overseas, which includes email, mobile and web data to assess of thin-credit file consumers in overseas markets. This is the technology behind FICO’s recently launched X Data Score in India, which generates a score based on a consumer’s mobile and digital footprint, including email data. FICO also has a traditional score in India as well.