While Kaggle is a popular destination for datasets, it’s not the only place to find valuable data for your data science and analytics practice. Here are 20+ other top data science sources to expand your dataset library:
- UNData: Access a comprehensive statistical database from the United Nations.
- Datasimplifier: Find curated datasets for data analytics.
- Tableau Public Data Sets: Explore a variety of datasets for use with Tableau.
- US Census Bureau: Dive into detailed demographic data from the U.S. Census.
- Amazon AWS Dataset: Discover large datasets across multiple domains hosted on AWS.
- UC Irvine Machine Learning Repository: Explore a rich collection of datasets specifically for machine learning.
- USA Open Data: Access a wide array of public datasets from the U.S. government.
- Wikipedia Data Sets: Utilize datasets derived from Wikipedia.
- World Bank Dataset: Find economic and development data from the World Bank.
- World Health Organization: Access global health data from WHO.
- Awesome Public Data Sources: A curated list of public data sources.
- Google Dataset: Search through a vast collection of scholarly datasets.
- Country Codes List: A comprehensive list of country codes for reference.
- FiveThirtyEight: Access data used in FiveThirtyEight’s analyses.
- BuzzFeed News: Explore datasets used in BuzzFeed News investigations.
- Kaggle: While it’s widely known, Kaggle remains a valuable resource for datasets.
- Socrata: Discover data sets from local governments and other public agencies.
- GitHub: Explore user-contributed datasets and repositories.
- Google Dataset Search: Use Google’s tool to search for datasets across the web.
- Data.gov: Find a vast collection of datasets provided by the U.S. government.
- Datahub: A community-driven data platform offering various datasets.