Name File Type Size Last Modified
retweets_congress_115th.csv.gz application/gzip 115.2 MB 05/14/2020 02:48:PM

Project Citation: 

Hemphill, Libby, Schöpke-Gonzalez, Angela M., Hodge, Caroline, and Bredernitz, Chris. Who Did the 115th US Congress Retweet¿? Ann Arbor, MI: Inter-university Consortium for Political and Social Research [distributor], 2020-05-14. https://doi.org/10.3886/E108303V2

Project Description

Summary:  View help for Summary This dataset includes the retweets posted on Twitter by accounts associated with members of the US Congress during the 115th Congress (2017-2018). The list of accounts combines two sources: 
Tweets were collected using Twitter's Search API through the twitter_user_collector Python script (https://github.com/casmlab/twitter_user_collector).

We filtered all tweets posted during the 115th Congress, leaving only those that have an associated attribute "retweeted_status", which indicates that the given CM's tweet is a retweet of another tweet. These retweets number 209,856 during the 115th Congress, made by 38,131 unique Twitter accounts.

We preserved and renamed metadata these tweets provided through Twitter's API, including the fields  'tweet_id_str', 'full_text', 'user_id_str', 'user_screen_name', 'user_followers_count', 'created_at', 'retweet_count', 'retweeted_status', and 'year' (extracted from 'created_at').

Beyond that tweet metadata provided through Twitter’s API, we collected additional demographic metadata for as many CMs as possible of those featured in our Tweet collection by using The United States Project's crowdsourced list of current legislators’ official Twitter handles, and associated metadata fields identifying a legislator’s unique bioguide ID ('bioguide' field),
name (‘name’ field), chamber (‘chamber’ field), party (‘party’ field), state represented (‘state’ field), gender (‘gender’ field), and birthday (‘birthday’ field). For those CMs not included in The United States Project, we manually searched for information to fill each of these metadata fields.

Based on which state each of these CMs represents, we assigned each CM a region (‘region’ field) based on those U.S. regional divisions outlined by Karl and Koss in their 1984 paper (https://repository.library.noaa.gov/view/noaa/10238) and which is also used by the U.S. National Centers for Environmental Information. For those states not captured by Karl and Koss’ regions, we made determinations ourselves and assigned them according to climatological and cultural contexts. In doing so, we developed an additional regional label, “Islands”. Those states or territories that we independently assigned include American Samoa, Virgin Islands, Puerto Rico, Hawaii, District of Columbia, and Alaska.

We determined age (‘age’ field) at the time of dataset creation (Jan. 10, 2020) according to CMs’ reported birthdays. We then grouped these ages into those age buckets 30-39, 40-49, 50-59, 60-69, 70-79, 80-89 (‘age_bucket’ field).

The OpenICPSR dataset features tweets by 520 CMs with this associated metadata.

Finally, we include fields which describe the original tweet that the CM retweeted and the user who posted it. We include that original poster’s Twitter user ID ('rt_user_id' field), Twitter screen name ('rt_screen_name' field), number of Twitter followers ('rt_followers_count' field), and user bio ('rt_bio' field). We extracted these fields from the JSON value included in the Twitter API's 'retweeted_status' field.





Scope of Project

Subject Terms:  View help for Subject Terms Congress; twitter
Geographic Coverage:  View help for Geographic Coverage United States
Time Period(s):  View help for Time Period(s) 1/1/2017 – 12/31/2017 (Calendar Year 2017)
Universe:  View help for Universe 462 Twitter accounts related to the US Congress
  • 447 members of the United States Congress (MCs)
  • 15 accounts associated with (a) former or deceased MCs or (b) campaigns associated with MCs
Data about members of Congress (e.g., party, chamber, gender, birthdate)

Data Type(s):  View help for Data Type(s) geographic information system (GIS) data; other; text

Methodology

Data Source:  View help for Data Source Twitter Search API
Collection Mode(s):  View help for Collection Mode(s) other
Unit(s) of Observation:  View help for Unit(s) of Observation retweets

Related Publications

Published Versions

Export Metadata

Report a Problem

Found a serious problem with the data, such as disclosure risk or copyrighted content? Let us know.

This material is distributed exactly as it arrived from the data depositor. ICPSR has not checked or processed this material. Users should consult the investigator(s) if further information is desired.