Fork me on GitHub

Sexualitics

Data Love. Porn Data.


Unless specified otherwise, the following datasets are released under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

xHamster

This in an exhaustive dataset of metadatas of all videos published on the site from its creation - 2007 - until february 2013. This represents almost 800,000 entries.

For each entry, the following metadatas are available:

Metadata Description Example % of Dataset
upload_date Day when the video was uploaded 4/30/2011 NA
title Title of the video "Tea party at Dick's house" NA
channels List of the video's tags ['Tea', 'Spoon', 'Sugar'] NA
description Description of the video "What a spoon !" NA
nb_views Number of times the video has been displayed 69 NA
nb_votes Number of users who voted for or against this video 42 NA
nb_comments Number of comments posted on this video 666 NA
runtime Length of the video in seconds 4815 NA
uploader Anonymized identifier of the uploader's username 6f60cbef5b891f80 NA
Download

JSON | CSV - 786,121 entries (50M)


Xnxx

This is a non-exhaustive dataset of metadatas for approximately one third of all videos published on the site until february 2013. This represents almost 1,200,000 entries.

For each entry, the following metadatas are available:

Metadata Description Example % of Dataset
title Title of the video "Tea party at Dick's house" NA
nb_comments Number of comments posted on this video 666 NA
tags List of the video's tags ['Tea', 'Spoon', 'Sugar'] NA

The interest of this dataset is its Tag ecosystem. Unlike other pornographic sites, Uploaders can tag the videos at will. Xnxx has got more than 6,000 tags for describing its videos.

Download

JSON | CSV - 1,166,278 entries (50M)


Derivated Datasets

Category Rankings

Various ranking methods for all categories in xHamster and XNXX

xHamster | XNXX

Links Over/Under representation

Matrix of all categories links strengh

xHamster | XNXX

ABOUT / SOURCE CODE / DATASETS / CITE