Data Is Plural

... is a weekly newsletter of useful/curious datasets.

2023.03.22 edition

Civilian harm in Ukraine, aid for Ukraine, municipal incorporations, political podcasts, and stop signs.

Civilian harm in Ukraine. Researchers at Bellingcat and contributors to its Global Authentication Project have assembled a map and dataset of 1,000+ incidents “that have resulted in potential civilian impact or harm since Russia began its invasion of Ukraine.” They include incidents “where rockets or missiles struck civilian areas,” “where attacks have resulted in the destruction of civilian infrastructure,” and/or where visual evidence depicts civilian injuries or “immobile civilian bodies.” The information, collected from public sources and vetted by Bellingcat, includes each incident’s date, location, description, sources, type of area affected, and type of weapon system (if known). [h/t Philip Bump]

Aid for Ukraine. Christoph Trebesch et al.’s Ukraine Support Tracker “lists and quantifies military, financial and humanitarian aid to Ukraine in the context of the Russia-Ukraine war.” The 1,400+ entries in the tracker’s dataset include contributions and commitments from 40 governments, plus several European Union institutions. (It does not include aid from NGOs and other non-state entities.) Each entry indicates the country, announcement date, type of aid, total value, description, sources, and more. The tracker’s next update is scheduled for March 29.

Municipal incorporations. Christopher B. Goodman, a professor of public administration, has consulted a range of state-level sources to compile a dataset listing the year of incorporation for 18,000+ municipalities in the United States. The dataset, which covers nearly 96% of all active municipalities, also provides each place’s name, state, coordinates, canonical ID in the Census, and more. Read more: In a Twitter thread, Goodman explains why he undertook the effort and shares a couple of visualizations. [h/t Maggie Lee]

Political podcasts. The Popular Political Podcast Dataset, developed by the Brookings Institution’s Valerie Wirtschafter and Chris Meserole, covers 50,000+ episodes from 100+ “prominent political podcast series” — the latter based on Apple Podcasts’ popularity rankings and its “You Might Also Like” recommendations. Updated daily and explorable online, the dataset provides each episode’s name, description, air date, and URL, plus the series name, partisan leaning, and Apple Podcasts category.

Stop signs. The City of Los Angeles publishes the location and orientation of 50,000+ local stop signs (plus a few yield signs). Other cities offering similar datasets include Houston, San Francisco, Detroit, Topeka, Menlo Park, and London, Ontario. Related: OpenStreetMap’s dataset features nearly 1.4 million stop signs located across the world. [h/t Matt Stiles]