Forecasting life expectancy through machine learning model

Estimator of the Life Expectancy (WEB APP)

Get started

Image Image

Can we correctly predict life expectancy?

And, if so, which features are most relevant for prediction?


Context / Motivation

We are curious analyze the life expectancy, because it is a key metric for assessing population health.

Shifts in life expectancy are often used to describe trends in mortality

Report / Insights

This project leverages the World Data Bank’s Development Indicators dataset, which contains development indicators for 214 countries and territories, along with 33 aggregate country groupings (e.g. world, regions, income levels).

Code / Repository

Using our group’s diverse field of specialisations, we have sought to collate a more rounded database(1999-2019) in which we built and deployed an interactive machine learning model(1999-2019) to predict life expectancy based in 19 key indicators.


OUR WORK_FLOW

With 1,345 indicators across 20 categories for 60 years, the World Bank Data Bank is an invaluable resource to help us explore insigns about the life expectancy.

Image

01

THE DATA

While the orginal datasets are fairly clean, there are missing data for several years for some metrics, and large gaps for specific counties. The dataset used for this project is a slightly transformed version of the raw files available here to facilitate analytics.


Furthermore, using this updated dataset, we will create a machine learning model in an attempt to forecast a more accurate prediction of life expectancy based on these indicators.

02

MACHINE LEARNING



             (2 MODELS)



We used Linear Regression as exploratory starting point to analyses the original data (1960-2019). The idea is infer causal relationships between the independent(life expectancy) and dependent variables( mostly economic indicators).

As the data consisted of hundreds of indicators, we filtered down the data by identifying 19 key indicators that we felt were most applicable, as well as the range of years we wanted the data to represent(1999-2019). For this data set we apply Gradient Boosting Regressor.

03

INSIGHTS/APP


Using Tableau and D3.js / Plotly we create insightful and impactful visualizations in an interactive and colorful way to extend the analysis of the life expectancy.


Finally we build and deploy a interacting machine learning Model(Gradient boosting Regressor) to predict life expectancy based in the last 20 years of the dataset (1999-2019) and ours 19 key indicators.


OUR_ANALYSIS

We used Linear Regression as exploratory starting point to analyses the original data (1960-2019).

Then we filtered down the data by identifying 19 key indicators that we felt were most applicable, as well as the range of years we wanted the data to represent(1999-2019).

For this data set we apply Gradient Boosting Regressor, which is the core for our application.

Image

GENERAL ANALYSIS



Inferential statistics & analysis

First, let’s explore the commonly-assumed relationship between life expectancy and economic prosperity. Using the income level classifications in the data, we plot the life expectancy trends from 1960 to 2019 for the high, middle, and low income, as well as the life expectancy trends for the world.

EXPLORE!

Image

SPECIFIC ANALYSIS


Gradient Boosting Regressor

Machine learning technique for regression, classification and other tasks, which produces a prediction model in the form of an ensemble of weak prediction models, typically decision trees.

In this case we have sought to collate a more rounded dataset(from 1999-2019) which considers 19 social, economic, mortality and health-related indicators in which we considered to be essential for the calculation of life expectancy

INSPECT!

Image




THE TEAM

Gradient Boosting make the function fit very flexible!

I manage the supply chain for a BioTech Pharmaceutical Company to get medical devices and drugs in and out of Australia and to various Hospital’s Critical Care Units across the country.

Image Arron, — Data Mining /ETL /Phython Analysis / ML /Streamlit

Streamlit is so easy to use!

Economist curious for technology. I Love to use a data-driven, research-based aproach to tell stories from datasets. Skilled in time management and organizational capabilities that excels ​working with teams.

Image Renzo, — Data Colletion /ETL / AWS(S3)/ Phython Analysis / Web Designer /ML/Streamlit

Machine learning really works!

I am a Secondary Economics Teacher who believes a logical economic mindset can develop a deeper and better understanding of this world. I have the drive to make a positive impact on people's lives and strive for the best outcomes, regardless of the level of the challenge.

Image Carmen, — Tableau / Data visualization / ETL /Phython Analysis

I can do a PHD in D3.js!

I am a graduate in Applied Mathematics, currently undertaking a Data Analytics certificate through Monash University.I have undertaken many applied statistics units as part of my degree.

Image Callum, — D3.js / Plotly / Tracing / Evaluation

I love Tableau!

Tech enthusiast with a curiosity and passion for data and code, with a background in business and the services industries.

Image Tealiie Le, — Tableau / Data visualization / Documentation

I enjoy to work with the world bank datasets!

As an International Relations and Journalism graduate, My vision is to analyse, visualise and present data and bring relevant stakeholders together, to create an environment for better evidenced based decision making.

Image Nicklaus, — D3.js / Documentation / Tracing / Story telling


Don't be shy,
Check out our app!!