Research Datasets
Datasets
Finding Datasets - Advanced Google Query for Datasets Known Datasets
Datasets to use for Research (See Research wiki)
Find additional datasets behind a login in the community dataset section
Finding Datasets
Google Data Set Search
Google ScholarGoogle Search Power for Academic Writing
JSTOR digital library of academic journals, books, and primary sources Research Gate Massive Database of Academic Journals .edu+%22free%22+%28%22research%22+or+%22dataset%22 Google Dork for Academic Research Resources Elicit AI journal search. Open Science Framework (OSF) is a free and open-source platform designed to support research and collaboration across the research life cycle. Awesome Public Datasets A curated list of over 40,000 public datasets across various topics. Awesome Cyber Threat Datasets provide context, mechanisms, indicators, implications, and actionable advice about an existing or emerging menace or hazard to assets that can inform decisions regarding the subject’s response to that menace or hazard_. r/datasets r/Dissertation r/AskAcademia r/GradSchool
Queries for Datasets
"Search_TERM_HERE" site:vision.in.tum.de OR site:www.cdbb.cam.ac.uk OR site:bimportal.scottishfuturestrust.org.uk OR site:digicatapult.org.uk OR site:pewresearch.org OR site:odsc.com OR site:archive.ics.uci.edu OR site:research.tudelft.nl OR site:archive.data.jhu.edu OR site:systems.jhu.edu
Case Studies and Projects Using Datasets
2 Tidy interactive spatio-temporal visualization of worldwide deaths related to various risk factors, specifically air pollution, substance use, and lack of sanitation. ## Known Datasets
URL | Comments | Free (Y/N) | Category | Region |
---|---|---|---|---|
Smithsonian Library Resources | This list includes databases, collections and search tools, selected by Smithsonian Libraries staff, that are freely available via the Internet. | Y | Academic | Global |
CrossSub Alt Link |
micro-level, subnational event data on armed conflict and contention around the world | Y | Conflict | Global |
ACLE | real-time data on the locations, dates, actors, fatalities, and types of all reported political violence and protest events around the world. | N | Conflict | Global |
OSMP | Open Source Munitions Portal (OSMP) A new open-source portal was just launched today by Airwars and Arms Research. Incredibly useful database, particularly for anyone covering armed conflicts or wars. | Y | Conflict | N/A |
LiveUA | factual reporting of a variety of important topics including conflicts, human rights issues, protests, terrorism, weapons deployment, health matters, natural disasters, and weather related stories, among others, from a vast array of sources | Y | Conflict | Ukraine |
Venezuelan Violence Data | Y | Conflict | LATAM - Mexico | |
-cities+%22COUNTRY-HERE%22&newwindow=1&client=firefox-b-1-d&sxsrf=ALiCzsaBIS8xQeZg9SWV58kErpaH3B1Ygg%3A1651200770193&ei=AlNrYoS2C5-LytMPiYONqAw&ved=0ahUKEwiEv_Caorj3AhWfhXIEHYlBA8UQ4dUDCA0&uact=5&oq=inurl%3Ahttps%3A%2F%2Fsimplemaps.com%2Fdata%2F*-cities+%22COUNTRY-HERE%22&gs_lcp=Cgdnd3Mtd2l6EAM6BwgAEEcQsANKBAhBGABKBAhGGABQylJYh5MBYJqaAWgCcAF4AIABdogBzQuSAQQ0LjEwmAEAoAEByAEIwAEB&sclient=gws-wiz World City Database | Database of cities with information of population and general Lat Long | Y | Country Data | Global |
TradingEconomics | Mass database of metrics and indicators by country over time | Y | Country Data | Global |
GitNux Crime Reports | Crime reports and stats | Crime | Global | |
Cloudflare Radar | A view of outages, threats, rankings and more based on the massive amount of cloudflare data | Y | Cyber | Global |
TUM Data | Large collection of data sets for computer vision research | Y | Cyber | |
CEPAL Cyber Attacks | Cyber Attacks in LATAM | Y | Cyber | LATAM |
OECD | Foreign Direct Investment (FDI) Statistics | Y | Finance & Business | Global |
World Bank Data | Economica Datasets | Y | Finance & Business | Global |
Scottish Futures Trust ROI Calculator | Calculator that allows the user to calculate the expected return on investment of a building project | Y | Finance & Business | |
Numbeo | cost of living calculator and comparison tool. Useful for determining the average price around the world. | Y | Finance & Business | Global |
Reportlinker | AI enabled Market Intelligence Platform | N | Finance & Business | |
BitcoinHeist Ransomware Address Dataset | Contains addresses labeled as belonging to one of the four categories: White, Gray, Black, or Unknown. | Y | Finance & Business | |
Oxford Robot Car Dataset | Dataset for autonomous driving research | Y | General | |
CDBB Data Science and AI Research | Research on data science, AI and machine learning, includes datasets | Y | General | |
Digital Catapult Innovation and Acceleration | Helps businesses bring new products and services to market | Y | General | |
Data Science Conference (ODSC) | Conference that brings together the data science community, including datasets and other resources | N | General | |
Party Facts Datasets | The Party Facts project is a gateway to empirical data about political parties and a modern online platform about parties and their history as recorded in social science datasets. It uses social media technologies to create a collaborative data infrastructure following an approach to collect data successfully applied by the Encyclopedia of Life (EOL). | Y | Politics | Global |
Chapel Hill Expert Survey for Latin America | Administered in 2020 and completed by 160 experts specializing in political parties, the 2020 CHES LA dataset provides information about the positioning of 112 political parties and presidents on political ideology, policy positions, party characteristics, and party linkages. The survey covers political parties and presidents in 12 Latin American countries. The | Y | Politics | LATAM |
Interpol Datasets | Police need up-to-date global data on criminals in order to carry out successful international investigations. | N | Politics & Law | Global |
Global Sanctions Dataset | A compilation of international sanctions against countries and entities. | Y | Politics & Law | Global |
Demographic and Social Dataset | Populations & People | Global | ||
GDelta | monitors the world’s TV broadcast, print, and web news from nearly every corner of every country in over 100 languages and identifies the people, locations, organizations, themes, sources, emotions, counts, quotes, images and events driving our global society every second of every day, creating a free open platform for computing on the entire world | Y | Populations & People | Global |
FEC | US Voting Data | Y | Populations & People | USA |
Github CIAWorldFactbook | CIA World Fact Book datasets | Y | Populations & People | |
Statista | Insights and facts across 170 industries and 150+ countries | Y | Populations & People | |
Princeton ESOC | event, public opinion, and spatial data | Y | Populations & People | |
Heavy.ai:ships | shipping | Y | Populations & People | |
Latino Baromitor | Populations & People | LATAM | ||
Mexican Socio-Economic Dataset | Y | Populations & People | LATAM - Mexico | |
IgnitionRobotics | Google Project to 3d model Objects | Y | Scans | |
JHU Space Systems | Satellite datasets and tools for space systems research | Y | Science | |
OmniSci Tweet Map | Tweet Data Map | Y | Social Media | |
Hamilton Dashboard | Massive Amounts of Tweet Data especially covering Russia, China, and Iran | Y | Social Media | Russia, China, and Iran |
Pew Research Center Datasets | Datasets on various social and political topics. | Y | Social Science | USA |
GitNux | Various | Global | ||
Our World In Data OWID GitHub Repo |
Research and data to make progress against the world’s largest problems. | Y | Various | Global |
Japanese Datasets | Japanese e-Stat | Y | Various | Japan |
JHU Data Archive | Datasets on social, health, and economic topics | Y | Various | |
Datasets from FiveThirtyEight | Datasets used in FiveThirtyEight articles and graphics. | Y | Various | USA |
Convert to excel or csv if needed : https://tableconvert.com/markdown-to-excel