Datasets

This page contains links to selected datasets collection that I’ve found. Feel free to email me if you have any suggestions!

Social Network Analysis

Stanford Large Network Dataset Collection

[SNAP is the best!] A substantial collection of data sets describing large networks.

Datasets for Social Network Analysis (Aminer.org)

Microblogging networks, patent data set, online social networks, knowledge linking dataset, mobile dataset, etc.

Network Data Repository

“The first interactive data and network repository with real-time analytics.”

konect - The Koblenz Network Collection

“KONECT (the Koblenz Network Collection) is a project to collect large network datasets of all types in order to perform research in network science and related fields, collected by the Institute of Web Science and Technologies at the University of Koblenz–Landau.KONECT contains over a hundred network datasets of various types, including directed, undirected, bipartite, weighted, unweighted, signed and rating networks.” — From the website.

Social Computing Data Repository at ASU - Datasets

Network datasets collected from famous websites including BlogCatalog, Buzznet, Delicious, Digg, Douban, Flickr, Flixster, Last.fm, Twitter, YouTube and so on. Some datasets contain both the contact network and selected group membership information. (Most datasets contain around 100k nodes.)

Datasets | Tore Opsahl

Datasets collected by Tore Opsahl (in tnet-format and some also in UCINET-format). It contains some small networks (# of nodes: 32-16,726).

BGU Social Networks Security Research Group

OSN datasets collection of BGU Social Networks Security Research Group. It contains directed networks (Anybeat, Academia.edu, Google+), undirected networks (TheMarker Cafe), multigraph networks (Students Network, WikiTree), and some other datasets of Facebook.

Social Computing Research @ MPI-SWS

Other sources of network data

Causal Inference

Last updated: 2018/11/16