FAQ | Problem?

internetarchive_2017-10-12_03-32-06.xlsx
internetarchive_2017-10-12_03-32-06.xlsx
From:
NodeXLExcelAutomator
Uploaded on:
October 12, 2017
Short Description:
internetarchive via NodeXL http://bit.ly/2i8lhKv
@internetarchive
@ieeehistory
@smp0312
@mfab974
@dbraz5
@supernovaautism
@via_shahar

Description:
Description
The graph represents a network of 1,641 Twitter users whose tweets in the requested range contained "internetarchive", or who were replied to or mentioned in those tweets. The network was obtained from the NodeXL Graph Server on Thursday, 12 October 2017 at 10:33 UTC.

The requested start date was Thursday, 12 October 2017 at 00:01 UTC and the maximum number of days (going backward) was 14.

The maximum number of tweets collected was 5,000.

The tweets in the network were tweeted over the 13-day, 23-hour, 23-minute period from Thursday, 28 September 2017 at 00:35 UTC to Wednesday, 11 October 2017 at 23:59 UTC.

Additional tweets that were mentioned in this data set were also collected from prior time periods. These tweets may expand the complete time period of the data.

There is an edge for each "replies-to" relationship in a tweet, an edge for each "mentions" relationship in a tweet, and a self-loop edge for each tweet that is not a "replies-to" or "mentions".

The graph is directed.

The graph's vertices were grouped by cluster using the Clauset-Newman-Moore cluster algorithm.

The graph was laid out using the Harel-Koren Fast Multiscale layout algorithm.


Author Description


Overall Graph Metrics
Vertices : 1641
Unique Edges : 2500
Edges With Duplicates : 1187
Total Edges : 3687
Self-Loops : 141
Reciprocated Vertex Pair Ratio : 0.030441400304414
Reciprocated Edge Ratio : 0.0590841949778434
Connected Components : 89
Single-Vertex Connected Components : 61
Maximum Vertices in a Connected Component : 1483
Maximum Edges in a Connected Component : 3488
Maximum Geodesic Distance (Diameter) : 8
Average Geodesic Distance : 2.406889
Graph Density : 0.00100622761255035
Modularity : 0.467076
NodeXL Version : 1.0.1.388

Top Influencers: Top 10 Vertices, Ranked by Betweenness Centrality
Top URLs
Top URLs in Tweet in Entire Graph:
[52] https://twitter.com/internetarchive/status/917836330624729088
[42] https://archive.org/details/gazetteofindia?sort=date
[22] https://fivethirtyeight.com/features/the-media-really-has-neglected-puerto-rico/
[18] https://www.vox.com/2017/10/2/16401614/fox-news-puerto-rico-charts
[17] https://archive.org/create/
[17] https://twitter.com/arxiverssf/status/915197151738781697
[14] https://twitter.com/internetarchive/status/918192348143616000
[13] https://blog.archive.org/2017/10/10/books-from-1923-to-1941-now-liberated/
[12] http://blog.archive.org/2017/10/05/wayback-machine-playback-now-with-timestamps/
[11] http://repository.wellesley.edu/cgi/viewcontent.cgi?article=1158&context=scholarship

Top URLs in Tweet in G1:
[10] http://blog.archive.org/2017/10/10/books-from-1923-to-1941-now-liberated/
[8] https://blog.archive.org/2017/10/10/books-from-1923-to-1941-now-liberated/
[6] https://arstechnica.com/tech-policy/2017/10/internet-archive-puts-full-out-of-print-books-from-20s-and-30s-online/
[6] https://boingboing.net/2017/10/10/library-public-domain.html
[6] https://archive.org/donate/
[5] https://archive.org/details/last20
[5] https://archive.org/details/georgeblood
[4] https://archive.org/details/mlbplayoffs
[3] http://tempsreel.nouvelobs.com/en-direct/a-chaud/43043-culture-etats-unis-litterature-milliers-livres-anterieurs.html
[3] https://twitter.com/internetarchive/status/917836330624729088

Top URLs in Tweet in G2:
[21] https://fivethirtyeight.com/features/the-media-really-has-neglected-puerto-rico/
[14] https://www.vox.com/2017/10/2/16401614/fox-news-puerto-rico-charts
[6] https://envirodatagov.org/wp-content/uploads/2017/10/WM-CCR-19-FEMA-Hurricane-Maria-171006.pdf
[6] https://envirodatagov.org
[6] https://www.washingtonpost.com/news/post-politics/wp/2017/10/05/fema-removes-statistics-about-drinking-water-access-and-electricity-in-puerto-rico-from-website/
[5] https://projects.propublica.org/politwoops/user/POTUS
[4] https://soundcloud.com/innovationhub/kahle-internet?platform=hootsuite
[3] https://archive.org/details/CNNW_20171001_150000_Reliable_Sources/start/360/end/399
[3] http://blogs.wgbh.org/innovation-hub/2017/4/7/kahle-internet/
[2] https://www.publishersweekly.com/pw/by-topic/digital/content-and-e-books/article/74877-the-business-of-making-e-books-free.html

Top URLs in Tweet in G3:
[9] http://repository.wellesley.edu/cgi/viewcontent.cgi?article=1158&context=scholarship
[9] http://blog.archive.org/2017/10/05/wayback-machine-playback-now-with-timestamps/
[2] https://envirodatagov.org/wp-content/uploads/2017/10/WM-CCR-19-FEMA-Hurricane-Maria-171006.pdf
[2] https://envirodatagov.org
[2] https://www.washingtonpost.com/news/post-politics/wp/2017/10/05/fema-removes-statistics-about-drinking-water-access-and-electricity-in-puerto-rico-from-website/
[2] https://simonwillison.net/2017/Oct/8/missing-content/
[1] https://twitter.com/i/web/status/916396877666570240
[1] https://twitter.com/i/web/status/916406503388655617
[1] https://twitter.com/i/web/status/916315121756049408
[1] http://files.beeldengeluid.nl/pdf/AllardOelen_ArchivingDynamicWebsites.pdf

Top URLs in Tweet in G4:
[12] https://twitter.com/internetarchive/status/917836330624729088
[10] http://www.icomu-master.info/
[10] http://archive.org/web/
[6] https://twitter.com/internetarchive/status/918192348143616000
[2] http://web.archive.org/web/20101222210558/http://ueno.cool.ne.jp:80/hero99/
[1] http://web.archive.org/web/*/yuriko.or.jp
[1] https://twitter.com/internetarchive/status/913518913991860224
[1] https://archive.org/details/floragraecasive10sibt
[1] https://twitter.com/i/web/status/913680378740789249
[1] https://twitter.com/i/web/status/913785425860468737

Top URLs in Tweet in G5:
[1] https://twitter.com/i/web/status/917492955899817984
[1] https://archive.org/details/gazetteofindia?sort=date
[1] https://www.eventbrite.com/e/aaron-swartz-day-2017-tickets-35394056576?aff=estw
[1] https://www.vox.com/2017/10/2/16401614/fox-news-puerto-rico-charts

Top URLs in Tweet in G6:
[41] https://archive.org/details/gazetteofindia?sort=date
[5] https://geekup.in/2017/right-to-information-and-knowledge
[3] https://blog.archive.org/2017/10/10/books-from-1923-to-1941-now-liberated/
[2] https://boingboing.net/2017/10/10/library-public-domain.html
[2] https://blog.archive.org/2017/08/30/the-internet-archives-annual-bash-come-celebrate-with-us/
[1] https://www.vox.com/2017/10/2/16401614/fox-news-puerto-rico-charts
[1] https://twitter.com/i/web/status/917158774279774208
[1] https://twitter.com/i/web/status/917875888187891712
[1] https://twitter.com/i/web/status/917876636485222405
[1] https://twitter.com/i/web/status/918217675154714624

Top URLs in Tweet in G7:
[1] https://twitter.com/i/web/status/914905804272173057
[1] https://twitter.com/thememoryhole2/status/916089601319624704
[1] https://www.swarmapp.com/c/juetLjM5Mub
[1] https://twitter.com/i/web/status/913355946155442176
[1] https://blog.archive.org/2011/08/17/scanning-a-braille-playboy/
[1] https://twitter.com/BorrisInABox/status/915610071001976833
[1] https://twitter.com/i/web/status/913356183053979650
[1] https://twitter.com/i/web/status/918014034884026368
[1] https://twitter.com/i/web/status/917518938384322560
[1] https://twitter.com/i/web/status/917531452916592640

Top URLs in Tweet in G8:
[7] http://tedium.co/2017/09/28/eudora-email-history/
[1] https://twitter.com/internetarchive/status/918192348143616000
[1] https://twitter.com/internetarchive/status/917836330624729088
[1] https://twitter.com/i/web/status/917860037460156416
[1] https://twitter.com/i/web/status/916316490445574144
[1] http://repository.wellesley.edu/cgi/viewcontent.cgi?article=1158&context=scholarship
[1] https://twitter.com/i/web/status/915210948637548546
[1] https://archive.org/details/warrenmethodofex00warr
[1] https://twitter.com/i/web/status/913552211548016641
[1] https://twitter.com/i/web/status/913203802504880128

Top URLs in Tweet in G9:
[17] https://archive.org/create/
[17] https://twitter.com/arxiverssf/status/915197151738781697
[1] https://twitter.com/i/web/status/914019135738335232
[1] https://twitter.com/i/web/status/915198024934141952

Top URLs in Tweet in G10:
[1] https://www.vox.com/2017/10/2/16401614/fox-news-puerto-rico-charts
[1] http://programminghistorian.org/lessons/data-mining-the-internet-archive
[1] http://blog.archive.org/2017/10/05/wayback-machine-playback-now-with-timestamps/

Top Domains
Top Domains in Tweet in Entire Graph:
[705] archive.org
[321] twitter.com
[22] fivethirtyeight.com
[18] vox.com
[18] envirodatagov.org
[16] blogspot.com
[11] wellesley.edu
[10] boingboing.net
[10] icomu-master.info
[9] washingtonpost.com

Top Domains in Tweet in G1:
[390] archive.org
[69] twitter.com
[6] arstechnica.com
[6] boingboing.net
[5] google.com
[4] eventbrite.com
[3] nouvelobs.com
[2] envirodatagov.org
[2] wiley.com
[2] apple.com

Top Domains in Tweet in G2:
[21] twitter.com
[21] fivethirtyeight.com
[14] vox.com
[12] envirodatagov.org
[8] archive.org
[6] washingtonpost.com
[5] propublica.org
[4] soundcloud.com
[3] wgbh.org
[2] publishersweekly.com

Top Domains in Tweet in G3:
[11] archive.org
[9] wellesley.edu
[6] twitter.com
[4] envirodatagov.org
[2] washingtonpost.com
[2] simonwillison.net
[1] beeldengeluid.nl
[1] blogspot.com
[1] kb.nl

Top Domains in Tweet in G4:
[52] archive.org
[34] twitter.com
[10] icomu-master.info
[2] trendolizer.com
[1] blogtecnologia.es
[1] github.com
[1] twilog.org
[1] co.jp
[1] conditioncharge.cal
[1] boingboing.net

Top Domains in Tweet in G5:
[1] twitter.com
[1] archive.org
[1] eventbrite.com
[1] vox.com

Top Domains in Tweet in G6:
[47] archive.org
[8] twitter.com
[5] geekup.in
[2] boingboing.net
[1] vox.com

Top Domains in Tweet in G7:
[12] twitter.com
[1] swarmapp.com
[1] archive.org

Top Domains in Tweet in G8:
[7] twitter.com
[7] tedium.co
[1] wellesley.edu
[1] archive.org

Top Domains in Tweet in G9:
[19] twitter.com
[17] archive.org

Top Domains in Tweet in G10:
[1] vox.com
[1] programminghistorian.org
[1] archive.org

Top Hashtags
Top Hashtags in Tweet in Entire Graph:
[26] internetarchive
[21] publicdomain
[15] copyright
[15] heritrix
[14] deadonthisdate
[12] webarchiving
[11] vinylrecords
[11] freefilmoftheday
[8] loosemeatsndwch
[7] ween



Top Hashtags in Tweet in G1:
[14] copyright
[4] publicdomain
[4] lasvegas
[4] uk
[3] ballachulish
[3] gratefuldead
[3] nationalcoffeeday
[2] iatimemachine
[2] bannedbooksweek
[2] podcast

Top Hashtags in Tweet in G2:
[3] podcast
[3] followfriday
[2] webarchiving
[2] ipres2017
[2] webbys
[1] heritrix
[1] ona17mw

Top Hashtags in Tweet in G3:
[14] heritrix
[10] webarchiving
[3] rvh2017
[2] access
[2] webarchives
[1] jupyternotebook
[1] nationalpoetryday
[1] tennyson

Top Hashtags in Tweet in G4:
[8] internetarchive
[1] ojibwa
[1] ruscha
[1] fawltytowers
[1] history
[1] nickfolk
[1] scandal
[1] htgawm
[1] tonyromo
[1] blackhawks

Top Hashtags in Tweet in G5:
[2] sanfrancisco

Top Hashtags in Tweet in G7:
[2] fronteers
[1] internetarchive
[1] waybackmachine

Top Hashtags in Tweet in G9:
[1] arxivemelmoment

Top Hashtags in Tweet in G10:
[1] internetarchive

Top Words
Top Words in Tweet in Entire Graph:
[1051] Words in Sentiment List#1: Positive
[408] Words in Sentiment List#2: Negative
[1] Words in Sentiment List#3: Angry/Violent
[29519] Non-categorized Words
[30978] Total Words
[1727] internetarchive
[362] archive
[284] https
[244] internet
[236] now

Top Words in Tweet in G1:
[828] internetarchive
[207] archive
[182] 1923
[171] 1941
[161] internet
[154] free
[150] https
[147] published
[146] make
[144] libraries

Top Words in Tweet in G2:
[109] internetarchive
[62] tvnewsarchive
[38] h
[32] internetarchive's
[30] coverage
[30] third
[30] eye
[28] wayback
[26] markgraham
[25] puerto

Top Words in Tweet in G3:
[90] internetarchive
[45] machine
[44] wayback
[21] adalerner
[21] missing
[21] blog
[20] now
[20] internetarchive's
[20] simonw
[20] recovered

Top Words in Tweet in G4:
[32] internetarchive
[25] internetarchiveさんから
[24] https
[23] t
[23] co
[13] today's
[13] archiving
[11] m
[10] internet
[10] archive

Top Words in Tweet in G5:
[59] internetarchive
[51] tv
[51] data
[51] https
[50] media
[50] really
[50] neglected
[50] puerto
[50] rico
[50] uses

Top Words in Tweet in G6:
[75] internetarchive
[54] carlmalamud
[43] running
[42] now
[42] simultaneous
[42] upload
[41] 65
[41] 015
[41] issues
[41] gazette

Top Words in Tweet in G7:
[62] internetarchive
[41] textfiles
[10] mozilla
[9] bryanlunduke
[8] oregon
[8] trail
[8] still
[8] being
[8] played
[8] online

Top Words in Tweet in G8:
[12] internetarchive
[9] sigcis
[9] comsoc
[9] carlo_cosmatos
[7] eulogy
[7] eudora
[7] mediahistorynow
[7] ieeeyp
[7] mediamorphis
[7] ieeehistory

Top Words in Tweet in G9:
[27] internetarchive
[25] arxiverssf
[17] per
[17] fer
[17] servir
[17] aquests
[17] aplicatius
[17] heu
[17] registrar
[12] aniol

Top Words in Tweet in G10:
[38] internetarchive
[30] looking
[30] research
[30] lesson
[30] wcaleb
[30] help
[30] automate
[30] downloading
[29] proghist
[3] textfiles

Top Word Pairs
Top Word Pairs in Tweet in Entire Graph:
[234] internet,archive
[191] archive,internetarchive
[190] 1923,1941
[190] streaming,internet
[177] wayback,machine
[160] free,download
[156] published,1923
[154] make,available
[150] download,streaming
[148] section,108h

Top Word Pairs in Tweet in G1:
[166] 1923,1941
[158] internet,archive
[140] make,available
[140] published,1923
[134] section,108h
[133] lets,libraries
[133] streaming,internet
[132] 108h,lets
[132] libraries,scan
[132] scan,make

Top Word Pairs in Tweet in G2:
[30] third,eye
[25] puerto,rico
[22] rico,coverage
[22] coverage,lagged
[22] lagged,behind
[22] behind,irma
[22] irma,harvey
[22] harvey,datadhrumil
[22] datadhrumil,tvnewsarchive
[22] tvnewsarchive,internetarchive

Top Word Pairs in Tweet in G3:
[44] wayback,machine
[20] recovered,213
[20] 213,missing
[20] missing,items
[20] items,blog
[20] blog,internetarchive
[20] internetarchive,using
[20] using,wayback
[20] machine,downloader
[20] downloader,hartator

Top Word Pairs in Tweet in G4:
[23] https,t
[23] t,co
[13] today's,archiving
[10] archiving,internetarchiveさんから
[10] 情報,アケマス
[10] アケマス,箱
[10] 箱,sp
[10] sp,dsコミュ情報サイト
[10] dsコミュ情報サイト,icomu
[10] icomu,m

Top Word Pairs in Tweet in G5:
[50] internetarchive,media
[50] media,really
[50] really,neglected
[50] neglected,puerto
[50] puerto,rico
[50] rico,uses
[50] uses,tv
[50] tv,archive
[50] archive,data
[50] data,analyze

Top Word Pairs in Tweet in G6:
[41] 65,015
[41] 015,issues
[41] issues,gazette
[41] gazette,india
[41] india,now
[41] now,up
[41] up,internetarchive
[41] internetarchive,running
[41] running,5
[41] 5,simultaneous

Top Word Pairs in Tweet in G7:
[13] textfiles,internetarchive
[8] oregon,trail
[8] trail,still
[8] still,being
[8] being,played
[8] played,online
[8] online,internetarchive
[8] internetarchive,4
[8] 4,seconds
[8] seconds,during

Top Word Pairs in Tweet in G8:
[9] sigcis,comsoc
[7] eulogy,eudora
[7] eudora,sigcis
[7] comsoc,mediahistorynow
[7] mediahistorynow,carlo_cosmatos
[7] carlo_cosmatos,ieeeyp
[7] ieeeyp,mediamorphis
[6] ieeehistory,eulogy
[6] mediamorphis,mediatwi
[5] librarycongress,internetarchive

Top Word Pairs in Tweet in G9:
[17] per,fer
[17] fer,servir
[17] servir,aquests
[17] aquests,aplicatius
[17] aplicatius,heu
[17] heu,registrar
[17] registrar,internetarchive
[16] arxiverssf,per
[7] arxiverssf,cupnacional
[7] cupnacional,omnium

Top Word Pairs in Tweet in G10:
[30] looking,research
[30] research,internetarchive
[30] internetarchive,lesson
[30] lesson,wcaleb
[30] wcaleb,help
[30] help,automate
[30] automate,downloading
[29] proghist,looking
[2] third,eye
[2] internetarchive,timestamps

Top Replied-To
Top Replied-To in Entire Graph:
@internetarchive
@olepennetier
@fsnjmg
@textfiles
@carlmalamud
@rebelliousval
@lukechilds
@diyclassics
@drherringchoker
@minuteravioli

Top Replied-To in G1:
@internetarchive
@fosscad
@zenlan
@therobgray
@bplboston
@subsublibrary
@bioinfocus
@psygnisfive
@jackaloshadows
@inkpuddle

Top Replied-To in G2:
@internetarchive
@alliomack
@careygillam
@pameladrew
@brewster_kahle
@demagogue69
@ealight461

Top Replied-To in G3:
@internetarchive
@edsu
@martijnkleppe
@brewbart
@markgraham
@valerie_schafer

Top Replied-To in G5:
@mcgrof
@pdp7

Top Replied-To in G6:
@carlmalamud
@internetarchive
@randomdabbler
@mani141210
@bbhorne
@aschrock

Top Replied-To in G7:
@textfiles
@drherringchoker
@minuteravioli
@creamywillbrie
@jaybird110127
@travisgoodspeed
@internetarchive
@cdisillusion
@worldnetdaily

Top Replied-To in G8:
@elizabethdebold
@cynicalgrrl

Top Replied-To in G9:
@arxiverssf
@casspf

Top Mentioned
Top Mentioned in Entire Graph:
@internetarchive
@fivethirtyeight
@carlmalamud
@tvnewsarchive
@srpimg
@markgraham
@textfiles
@wcaleb
@proghist

Top Mentioned in G1:
@internetarchive
@ridt
@mantzarlis
@vinckirala
@w758
@jessesheidlower
@subsublibrary
@tvnewsarchive
@drasticactionsa
@tnoisette

Top Mentioned in G2:
@internetarchive
@tvnewsarchive
@markgraham
@datadhrumil
@voxdotcom
@brewster_kahle
@tracey_pooh
@realdonaldtrump
@thewebbyawards
@propublica

Top Mentioned in G3:
@internetarchive
@adalerner
@simonw
@hartator
@edsu
@webrecorder_io
@netpreserve
@phonedude_mln
@markgraham
@ukwebarchive

Top Mentioned in G4:
@archerytrifours

Top Mentioned in G5:
@internetarchive
@fivethirtyeight
@aaronswartzday
@lisarein
@dcschelt
@xychelsea
@pdp7
@mcgrof
@carlmalamud
@voxdotcom

Top Mentioned in G6:
@internetarchive
@carlmalamud
@digitaldutta
@brewster_kahle
@nyaayain
@howardknopf
@fivethirtyeight
@voxdotcom
@howar

Top Mentioned in G7:
@internetarchive
@textfiles
@bryanlunduke
@mozilla
@realalexjones
@archiveteam
@robmanuel
@creamywillbrie
@firefox
@flurin

Top Mentioned in G8:
@internetarchive
@sigcis
@comsoc
@carlo_cosmatos
@mediahistorynow
@ieeeyp
@mediamorphis
@ieeehistory
@librarycongress
@mediatwi

Top Mentioned in G9:
@internetarchive
@arxiverssf
@aniol
@cupnacional
@omnium
@joantarda
@nriacarrerasfon
@arxiverslleida
@arxiu_lloret
@arxiutv3cr

Top Mentioned in G10:
@internetarchive
@wcaleb
@proghist
@textfiles
@fivethirtyeight
@voxdotcom

Top Tweeters
Top Tweeters in Entire Graph:
@ee
@robre62
@alizardx
@cloudwanderer3
@gen_ago
@katelaity
@fjfjf
@dominiquevanpee
@billm9
@infobae

Top Tweeters in G1:
@cloudwanderer3
@gen_ago
@katelaity
@dmvecinal
@eyegloarts
@flugennock
@ctdron
@emptywheel
@bifflawson
@runtodaylight

Top Tweeters in G2:
@billm9
@curran_marlene
@alliomack
@samhainnight
@pameladrew
@iamthedidi
@msnbc
@hanneshanath
@smp0312
@benosteen

Top Tweeters in G3:
@ee
@mistydemeo
@tabatkins
@brucel
@brewbart
@brendaneich
@aldopinga
@kjhank
@ade_oshineye
@jameschurchman

Top Tweeters in G4:
@red_global
@open_govern
@hesuko
@acmaswikibot
@tsukadasatoshi
@hrozvitnir
@gershbec
@abeshl25
@comfortface
@headshooter

Top Tweeters in G5:
@thetimepast
@lee4hmz
@gmcustodio
@kingmike33
@fly4sarah
@bubalub1021
@eaterofsoles
@fivethirtyeight
@kevinkresse
@lisarein

Top Tweeters in G6:
@shyduroff
@kewrious
@legalkant
@jackerhack
@bbhorne
@memeghnad
@carlmalamud
@sushantsinha
@ankushwithrg
@bombaywallah

Top Tweeters in G7:
@alasdairstuart
@chrisboese
@xionyc
@textfiles
@robmanuel
@worldnetdaily
@kamihack
@jmsl
@pamela_drouin
@realalexjones

Top Tweeters in G8:
@aol
@tedgrunewald
@jwomack
@google
@mediatwit
@mikemccaffrey
@jfagone
@mediamorphis
@mediahistorynow
@hammerdaily

Top Tweeters in G9:
@nuvol_com
@cupnacional
@aniol
@ouyuni
@francinanavarro
@adriancruzesp
@casspf
@dvdgmz
@joantarda
@stevenmaccall

Top Tweeters in G10:
@miriamkp
@hrabiakent
@historyanddigi
@sarahebond
@wcaleb
@omizorm
@pj_webster
@humpa_plymouth
@giovannidamiola
@fenrefstaff