Data related to the paper "Studying social unrest through the lens of social media"
DOI: 10.4121/649e8f5d-8e40-4ab7-9d07-b5ef53d810f0
Datacite citation style
Dataset
Categories
Time coverage 2023-06-23 19:17:18 to 2023-07-16 02:49:31.353
Licence CC BY 4.0
Interoperability
Dataset corresponding to the paper "Studying social unrest through the lens of social media".
107,674 geolocated visual posts from a social media were collected during and after the 'Nahel Merzouk' riots in the summer 2023 in 7 French cities. These posts were fed to a computer vision model with the objective of identifying riot-related posts. This dataset contains the metadata (date, time, and location) of those posts along with the label of the posts (according to the model). Riot-related posts are then clustered into "events", based on their spatiotemporal proximity (see paper for more details).
Columns:
"timestamp" (TIMESTAMP): Date and time of the posts
"latitude" (REAL): Latitude at which the post was published
"longitude" (REAL): Longitude at which the post was published
"pred_class" (INTEGER): Binary variable with value 1 if it represents a riot, 0 otherwise
"event" (TEXT): Event associated to the post, structured as follows:
"No event" if the post is not marked as riot-related
"day_city_id" with "day" being the day of the month associated to the event, such as "2", "city" being the city in which the event happened, such as "Paris", "id" being an integer. "29_Marseille_0" corresponds to event "0" happening in Marseille on June 29th 2023. If the value of the id is "-1", the post could not be associated to any event.
History
- 2024-12-12 first online
- 2025-06-06 published, posted
Publisher
4TU.ResearchDataFormat
sqlite3Organizations
TU Delft, Faculty of Civil Engineering and Geosciences, Department of Transport and PlanningDATA
Files (2)
- 1,768 bytesMD5:
5d8cb32612f41aab3ecc309a6d3f4a3c
README.md - 6,791,168 bytesMD5:
a8dd8e3c4d05580803ba716da7c6ae1f
posts.db -
download all files (zip)
6,792,936 bytes unzipped