2022 Russian Invasion of Ukraine
VIINA/ВІЙНА/ВОЙНА/WAR (Violent Incident Information from News Articles) is a near-real time multi-source event data system for the 2022 Russian Invasion of Ukraine. These data are based on news reports from Ukrainian and Russian media, which were geocoded and classified into standard conflict event categories through machine learning. These data are GIS-ready, with temporal precision down to the minute. Each observation is accompanied by full source information, text and URLs. In addition to raw events, VIINA also includes data on territorial control, at the level of individual populated places. VIINA is updated daily, and is freely available for use by students, journalists, policymakers, and everyday researchers.
Github repository: github.com/zhukovyuri/VIINA
Zhukov, Yuri (2022). “Near-Real Time Analysis of War and Economic Activity during Russia’s Invasion of Ukraine.” Journal of Comparative Economics 51 (4): 1232-1243 doi.org/10.1016/j.jce.2023.06.003
Download latest VIINA data
Using an automated web scraping routine (which runs every 6 hours), VIINA extracts the text of news reports published by each source and their associated metadata (publication time and date, web urls). Using natural language processing, the system extracts and geocodes location names mentioned in each news item. A transformer model then classifies each event report into several pre-defined categories.
Dataset | Link | Format |
---|---|---|
Raw event reports, 2022 (locations, dates, urls, headlines) | event_info_latest_2022.zip | csv (zipped) |
Raw event reports, 2023 (locations, dates, urls, headlines) | event_info_latest_2023.zip | csv (zipped) |
Raw event reports, 2024 (locations, dates, urls, headlines) | event_info_latest_2024.zip | csv (zipped) |
Event reports labeled by actor and tactic, 2022 (from BERT model) | event_labels_latest_2022.zip | csv (zipped) |
Event reports labeled by actor and tactic, 2023 (from BERT model) | event_labels_latest_2023.zip | csv (zipped) |
Event reports labeled by actor and tactic, 2024 (from BERT model) | event_labels_latest_2024.zip | csv (zipped) |
De-duplicated event reports and labels, 2022 ("one-per-day" filter) | event_1pd_latest_2022.zip | csv (zipped) |
De-duplicated event reports and labels, 2023 ("one-per-day" filter) | event_1pd_latest_2023.zip | csv (zipped) |
De-duplicated event reports and labels, 2024 ("one-per-day" filter) | event_1pd_latest_2024.zip | csv (zipped) |
Territorial control, 2022 | control_latest_2022.zip | csv (zipped) |
Territorial control, 2023 | control_latest_2023.zip | csv (zipped) |
Territorial control, 2024 | control_latest_2024.zip | csv (zipped) |
Ukrainian populated places | gn_UA_tess.geojson | geojson (polygons) |