Who Did the 115th US Congress Retweet ?
Principal Investigator(s): View help for Principal Investigator(s) Libby Hemphill, University of Michigan; Angela M. Schöpke-Gonzalez, University of Michigan; Caroline Hodge, University of Michigan; Chris Bredernitz, University of Michigan
Version: View help for Version V2
Name | File Type | Size | Last Modified |
---|---|---|---|
retweets_congress_115th.csv.gz | application/gzip | 115.2 MB | 05/14/2020 02:48:PM |
Project Citation:
Project Description
- Justin Littman's list (https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/UIVHQR)
- The United States project list (https://github.com/unitedstates/congress-legislators)
We filtered all tweets posted during the 115th Congress, leaving only those that have an associated attribute "retweeted_status", which indicates that the given CM's tweet is a retweet of another tweet. These retweets number 209,856 during the 115th Congress, made by 38,131 unique Twitter accounts.
We preserved and renamed metadata these tweets provided through Twitter's API, including the fields 'tweet_id_str', 'full_text', 'user_id_str', 'user_screen_name', 'user_followers_count', 'created_at', 'retweet_count', 'retweeted_status', and 'year' (extracted from 'created_at').
Beyond that tweet metadata provided through Twitter’s API, we collected additional demographic metadata for as many CMs as possible of those featured in our Tweet collection by using The United States Project's crowdsourced list of current legislators’ official Twitter handles, and associated metadata fields identifying a legislator’s unique bioguide ID ('bioguide' field), name (‘name’ field), chamber (‘chamber’ field), party (‘party’ field), state represented (‘state’ field), gender (‘gender’ field), and birthday (‘birthday’ field). For those CMs not included in The United States Project, we manually searched for information to fill each of these metadata fields.
Based on which state each of these CMs represents, we assigned each CM a region (‘region’ field) based on those U.S. regional divisions outlined by Karl and Koss in their 1984 paper (https://repository.library.noaa.gov/view/noaa/10238) and which is also used by the U.S. National Centers for Environmental Information. For those states not captured by Karl and Koss’ regions, we made determinations ourselves and assigned them according to climatological and cultural contexts. In doing so, we developed an additional regional label, “Islands”. Those states or territories that we independently assigned include American Samoa, Virgin Islands, Puerto Rico, Hawaii, District of Columbia, and Alaska.
We determined age (‘age’ field) at the time of dataset creation (Jan. 10, 2020) according to CMs’ reported birthdays. We then grouped these ages into those age buckets 30-39, 40-49, 50-59, 60-69, 70-79, 80-89 (‘age_bucket’ field).
The OpenICPSR dataset features tweets by 520 CMs with this associated metadata.
Finally, we include fields which describe the original tweet that the CM retweeted and the user who posted it. We include that original poster’s Twitter user ID ('rt_user_id' field), Twitter screen name ('rt_screen_name' field), number of Twitter followers ('rt_followers_count' field), and user bio ('rt_bio' field). We extracted these fields from the JSON value included in the Twitter API's 'retweeted_status' field.
Scope of Project
- 447 members of the United States Congress (MCs)
- 15 accounts associated with (a) former or deceased MCs or (b) campaigns associated with MCs
Methodology
Related Publications
Published Versions
Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.
This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.