7 Data Science Project Ideas for Aspiring Data Scientists

栏目: IT技术 · 发布时间: 4年前

7 Data Science Project Ideas for Aspiring Data Scientists

A beginner-friendly list of data science projects for May 2020

May 5 ·4min read

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by Max Duzij on Unsplash

Due to popular demand and many requests, I decided to create a unique list of data science projects for those that are beginning their journey as a Data Scientist. There’s a mix of visualization projects, exploratory data analysis projects, and predictive modeling. I hope you enjoy this article and wish you the best of luck in your endeavors!

Rainfall in India

Project type: Visualization

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by Julian Yu on Unsplash

This dataset contains monthly rainfall details of 36 sub-divisions of India. Here are some visualization ideas you can try for yourself:

  • You can create bar graphs or pie graphs to compare the amount of rainfall by region
  • You can create a line graph to compare rainfall by region over time
  • You can create an animated choropleth map to show where it rains over time! If you want to learn how to build a choropleth visualization, check out my tutorial here .

Global Suicide Rates

Project type: Exploratory Data Analysis

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by Sasha Freemind on Unsplash

This is a consolidated dataset with details on suicide rates, human development index (HDI) numbers, GDP, and demographics by country by year. The purpose of this dataset was to see if there are any indicators that are correlated to increased suicide rates.

Explore the data and see what countries and continents have the highest suicide rates. What trends do you notice? Are suicide rates increasing or decreasing overall? What is the proportion of the number of suicides between males and females? See if you can find any variables that are correlated with suicide rates.

Summer Olympic Medals

Project type: Exploratory Data Analysis

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by Bryan Turner on Unsplash

On a less morbid note, here’s a dataset that contains all of the medal winners in the Summer Olympics from 1976 Montreal to 2008 Beijing. Explore the data and see which countries have won the most medals overall. Are there countries that are performing better over time? What about worse over time?

World Happiness Report

Project type: Exploratory Data Analysis

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by KAL VISUALS on Unsplash

The happiness score is a quantifiable measurement of the average ‘happiness’ of a country. This is based on six factors: economic production, social support, life expectancy, freedom, absence of corruption, and generosity.

This dataset contains 155 countries and their associated happiness scores and 6 factors from 2015 to 2019. Are we globally becoming more or less happier each year? What continent is the happiest? The least happy? Which of the six factors has the biggest impact on happiness? What about the least impact?

Pollution in the United States

Project type: Visualization

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by Ella Ivanescu on Unsplash

This dataset contains information on the four major pollutants (Nitrogen Dioxide, Sulphur Dioxide, Carbon Monoxide, and Ozone) for every day from 2000 to 2016 in the United States.

Here are some visualization ideas:

  • What states are the biggest polluters? The least?
  • How much has the US polluted over time? Are they polluting more than 10 years ago or less?
  • See if you can create a choropleth map to show geographically the level of pollution over time!

Nutrition Facts for McDonald’s Menu

Project type: Exploratory Data Analysis

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by XUNO. on Unsplash

This dataset provides a nutrition analysis of every menu item on the US McDonald’s menu, including breakfast, beef burgers, chicken and fish sandwiches, fries, salads, soda, coffee and tea, milkshakes, and desserts.

How many calories does the average McDonald’s value meal contain? Is it really healthier to order grilled chicken instead of crispy? What is the healthiest combination of items that you would have to eat to get your daily nutritional requirements?

Red Wine Quality

Project type: Prediction Modeling

Link to dataset here .

7 Data Science Project Ideas for Aspiring Data Scientists

Photo by Terry Vlisidis on Unsplash

This dataset contains data on various wines, their composition, and their wine quality. This can be a regression or classification problem depending on how you frame it. See if you can predict the quality of a red wine given 11 inputs (fixed acidity, volatile acidity, citric acid, residual sugar, chlorides, free sulfur dioxide, total sulfur dioxide, density, pH, sulfates, and alcohol.

Thanks for Reading!

If you like my work and want to support me, I’d greatly appreciate if you followed me on my social media channels:

  1. The BEST way to support me is by following me on Medium here .
  2. Follow me on Twitter here .
  3. Subscribe to my new YouTube channel here .
  4. Follow me on LinkedIn here .
  5. Sign up on my email list here .
  6. Check out my website terenceshin.com .

以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

Python极客项目编程

Python极客项目编程

Mahesh Venkitachalam / 王海鹏 / 人民邮电出版社 / 2017-5 / 69

Python 是一种强大的编程语言,容易学习而且充满乐趣。但掌握了基本知识后,接下来做什么? 本书包含了一组富有想象力的编程项目,它们将引导你用Python 来制作图像和音乐、模拟现实世界的现象,并与 Arduino 和树莓派这样的硬件进行交互。你将学习使用常见的Python 工具和库,如numpy、matplotlib 和pygame, 来完成以下工作: ● 利用参数方程......一起来看看 《Python极客项目编程》 这本书的介绍吧!

在线进制转换器
在线进制转换器

各进制数互转换器

XML 在线格式化
XML 在线格式化

在线 XML 格式化压缩工具

HSV CMYK 转换工具
HSV CMYK 转换工具

HSV CMYK互换工具