Commit 3581ea17 authored by Aral Balkan's avatar Aral Balkan

Add some experiments

parent ffcae14a
["digitaltrends.com","gizmodo.com","mashable.com","thenextweb.com","tech2.com","techcrunch.com","technorati.com","techradar.com","theverge.com","wired.com","ft.com","forbes.com","wsj.com","bbc.co.uk/news","channel4.com/news","dailystar.co.uk","express.co.uk","theguardian.com","independent.co.uk","mirror.co.uk","news.sky.com","thesun.co.uk","telegraph.co.uk","thetimes.co.uk","theatlantic.com","newsday.com","nypost.com","nytimes.com","chicago.suntimes.com","chicagotribune.com","denverpost.com","latimes.com","nydailynews.com","usatoday.com","washingtonpost.com","nationalgeographic.com","pressassociation.com","vogue.com","wmagazine.com","glamour.com","allure.com","self.com","style.com","teenvogue.com","gq.com","architecturaldigest.com","worldofinteriors.co.uk","brides.com","golfdigest.com","golfworlddigital.com","bonappetit.com","epicurious.com","cntraveler.com","arstechnica.com","backchannel.com","vanityfair.com","newyorker.com","tatler.com","pitchfork.com","economist.com","buzzfeed.com","space.com","olivemagazine.com","techrepublic.com","scientificamerican.com"]
\ No newline at end of file
// Take the top 5000 CSV and convert it to JSON (and save it as domains.json)
const fs = require('fs')
const commaSeparated = fs.readFileSync('alexa-top-500-news.csv', 'utf-8')
const commaSeparatedList = commaSeparated.split('\n')
const domainList = commaSeparatedList.map(domain => domain.replace(/.*?,/,''))
const json = JSON.stringify(domainList)
fs.writeFileSync('domains-news.json', json)
["reddit.com","Cnn.com","Nytimes.com","Huffingtonpost.com","Theguardian.com","Weather.com","news.google.com","News.yahoo.com","Foxnews.com","Forbes.com","Bbc.co.uk/news","Timesofindia.indiatimes.com","Shutterstock.com","Accuweather.com","Usatoday.com","Bloomberg.com","Wsj.com","Reuters.com","Money.cnn.com","Nbcnews.com","Cbsnews.com","Wunderground.com","Time.com","Economictimes.indiatimes.com","Drudgereport.com","Abcnews.go.com","Latimes.com","Indianexpress.com","Nypost.com","Chron.com","Cnbc.com","Thehindu.com","My.yahoo.com","Weather.gov","Sfgate.com","Theatlantic.com","Usnews.com","Eenadu.net","Topix.com","Nationalgeographic.com","Breitbart.com","Navbharattimes.indiatimes.com","Chicagotribune.com","Hindustantimes.com","Theguardian.com/world","Bankrate.com","Hollywoodreporter.com","Cbc.ca/news","Fortune.com","News.com.au","Manoramaonline.com","Yr.no","Economist.com","Smh.com.au","Alarabiya.net","Dnaindia.com","Dw.com","Yonhapnews.co.kr","Variety.com","Thehill.com","Newsnow.co.uk","Andhrajyothy.com","Ap.org","Owl.english.purdue.edu","Amarujala.com","Bostonglobe.com","Newsmax.com","Theonion.com","Nj.com","Aljazeera.com","News.sky.com","Business-standard.com","Anandabazar.com","Theglobeandmail.com","Rawstory.com","Examiner.com","Adweek.com","Irna.ir","Dallasnews.com","Bdnews24.com","Intellicast.com","Prnewswire.com","Livemint.com","Metafilter.com","Mathrubhumi.com","Philly.com","Seattletimes.com","Newsweek.com","Freep.com","Startribune.com","Thestar.com","Fark.com","Washingtontimes.com","Financialexpress.com","Theage.com.au","Euronews.com","Zougla.gr","Deccanchronicle.com","Azcentral.com","Suntimes.com","Voanews.com","Theweek.com","Ctvnews.ca","Al.com","Csmonitor.com","Denverpost.com","Ajc.com","Upi.com","Radar.weather.gov","Miamiherald.com","Stltoday.com","Lexisnexis.com","Mercurynews.com","Washingtonexaminer.com","Rd.com","Alternet.org","Observer.com","Metoffice.gov.uk","France24.com","Bbc.co.uk/news/business","Newser.com","Nationalpost.com","Heraldsun.com.au","Baltimoresun.com","Pbs.org/newshour","Foxbusiness.com","Theguardian.com/media","Thetimes.co.uk","Theaustralian.com.au","Wn.com","Mid-day.com","Cnet.com/videos","Detroitnews.com","Deseretnews.com","Theguardian.com/us","Jsonline.com","Dailytelegraph.com.au","Orlandosentinel.com","Tampabay.com","Sacbee.com","Seattlepi.com","Syracuse.com","Itv.com/news","Thehindubusinessline.com","Good.is","Sltrib.com","Dailythanthi.com","Newsday.com","Digitalspy.co.uk","Foreca.com","Ocregister.com","Theconversation.com","Indystar.com","Sun-sentinel.com","Mysanantonio.com","Wtop.com","Prweb.com","Bostonherald.com","Tennessean.com","Sandiegouniontribune.com","Newrepublic.com","Fcc.gov","Kansascity.com","Smartbrief.com","International.nytimes.com","Theepochtimes.com","Dinakaran.com","Theroot.com","Newsobserver.com","Newsvine.com","Sfchronicle.com","Telegraphindia.com","Afr.com","Theadvocate.com","Uk.reuters.com","Reviewjournal.com","Tribuneindia.com","Deccanherald.com","Rte.ie/news","Pewresearch.org","Arcamax.com","Newyorker.com/humor/borowitz-report","Thestranger.com","Usatoday.com/money","Richmond.com","C-span.org","Triblive.com","Dinamani.com","Pjmedia.com/instapundit","Courant.com","Npr.org/programs","Statesman.com","Vancouversun.com","Commondreams.org","Timesunion.com","Diversityinc.com","Mediamatters.org","Townhall.com/columnists","Losangeles.cbslocal.com","Weatherbug.com","Sandesh.com","Couriermail.com.au","Newsok.com","Globalpost.com","Omaha.com","Northjersey.com","Adn.com","Parade.com","Twincities.com","Dailyfinance.com","Pressdemocrat.com","Federalreserve.gov","Breakingnews.com","Palmbeachpost.com","Buffalonews.com","Citypages.com","Bangaloremirror.com","Siasat.com","Tbo.com","Adelaidenow.com.au","Uexpress.com","Staradvertiser.com","Desmoinesregister.com","Thedrum.com","Dailyherald.com","Newsmeback.com","Westword.com","Stripes.com","Journalstar.com","Mumbaimirror.com","Mcall.com","Star-telegram.com","Delawareonline.com","Pilotonline.com","Montrealgazette.com","Houstonpress.com","Lucianne.com","Mcclatchydc.com","Calgaryherald.com","Macleans.ca","Wickedlocal.com","Bbc.co.uk/worldservice","Telegram.com","Villagevoice.com","Lasvegassun.com","Factcheck.org","Lancasteronline.com","Redtram.com","Cision.com","Lohud.com","Independent.ie/business","Host.madison.com/wsj","Pe.com","Edmontonjournal.com","Militarytimes.com","Truthdig.com","Democratandchronicle.com","Providencejournal.com","Wfp.org","Mediapost.com","Dailynews.com","Thestatesman.com","Canberratimes.com.au","Roanoke.com","Rferl.org","Poynter.org","Jacksonville.com","Thedailymash.co.uk","Indymedia.org","Spiegel.de/international","Knoxnews.com","Greaterkashmir.com","Project-syndicate.org","Clarionledger.com","Wsj.com/news/opinion","Phoenixnewtimes.com","Thestate.com","Weather.aol.com","Ajitjalandhar.com","Dallasobserver.com","Tucson.com","Winnipegfreepress.com","Metronews.ca","Onlinenewspapers.com","Postandcourier.com","Corbisimages.com","Frontpagemag.com","Nwitimes.com","Nj.com/news","Heraldtribune.com","nyse.com","Pollen.com","Copyright.gov","Lehighvalleylive.com","Npr.org/sections/news","News-gazette.com","Abqjournal.com","Desertsun.com","Thenewstribune.com","Thecrimson.com","Arkansasonline.com","Snow-forecast.com","Ctpost.com","Thechronicleherald.ca","Gazette.com","Trove.nla.gov.au/newspaper","Ljworld.com","Niemanlab.org","Bizjournals.com/sanjose","Mediabistro.com","Commercialappeal.com","Theprovince.com","Afp.com","Unionleader.com","Journalnow.com","Bizjournals.com/boston","Spokesman.com","Bizjournals.com/sanfrancisco","Duluthnewstribune.com","Heraldnet.com","Floridatoday.com","Bizjournals.com/seattle","Greensboro.com","Heraldextra.com","Rgj.com","Bbc.co.uk/5live","Orlandoweekly.com","Einnews.com","Rollcall.com","News.harvard.edu/gazette","Dailycamera.com","Crainsnewyork.com","Dailypress.com","weather.yahoo.com","Onlineathens.com","Idahostatesman.com","Goodnewsnetwork.org","Rsf.org","Dailycal.org","Dailypioneer.com","Dowjones.com","Cjonline.com","Billoreilly.com","Dailyprogress.com","Fresnobee.com","Chicagoreader.com","Bizjournals.com/washington","Aol.com/news","Grandforksherald.com","Austinchronicle.com","Newswire.ca","Registerguard.com","Washingtoncitypaper.com","Merinews.com","Sfweekly.com","Wvgazettemail.com","Cbn.com/cbnnews","Sandiegoreader.com","Qctimes.com","Columbiatribune.com","Businesswireindia.com","Pressofatlanticcity.com","Burlingtonfreepress.com","Creators.com","Mlive.com/ann-arbor","Guardianlv.com","Metrotimes.com","Naplesnews.com","Globalissues.org","Opendemocracy.net","Nhregister.com","Pantagraph.com","Lansingstatejournal.com","Tallahassee.com","Argusleader.com","Newstimes.com","Macon.com","Yourhoustonnews.com","News-press.com","Therealnews.com","Chicagotribune.com/suburbs/daily-southtown","Columbian.com","Bizjournals.com/triangle","Magnumphotos.com","Nashvillescene.com","Briefing.com","Vcstar.com","Money.cnn.com/news","Governing.com","Jewishworldreview.com","Poughkeepsiejournal.com","Pjstar.com","Law.com","Tradingpost.com.au","Gainesville.com","Bizjournals.com/stlouis","Greenbaypressgazette.com","Durangoherald.com","Stanforddaily.com","Statesmanjournal.com","Assamtribune.com","Theledger.com","Asianage.com","Straight.com","Calgarysun.com","Citywire.co.uk","24-7pressrelease.com","Harpers.org","Chicagotribune.com/suburbs/post-tribune","Dailybreeze.com","Bizjournals.com/twincities","Timescolonist.com","Readingeagle.com","People-press.org","Bismarcktribune.com","Washingtonmonthly.com","Stamfordadvocate.com","Greenvilleonline.com","Multichannel.com","Ibj.com","Savannahnow.com","Journalgazette.net","Sunshinecoastdaily.com.au","Capitalgazette.com","Lacrossetribune.com","Fredericksburg.com","Steynonline.com","Themercury.com.au","Postbulletin.com","Theday.com","Santacruzsentinel.com","Lubbockonline.com","Tcpalm.com","Ireport.cnn.com","Sanluisobispo.com","Economist.com/topics","Sabanews.net","Theoaklandpress.com","Postindependent.com","Newsminer.com","Wacotrib.com","News-leader.com","Theherald.com.au","Starnewsonline.com","Bellinghamherald.com","Huffingtonpost.com/weird-news","Cjr.org","Kitsapsun.com","Sourcewatch.org","Courierpostonline.com","Onthesnow.com","Santafenewmexican.com","Sj-r.com","Sacurrent.com","Disinfo.com","Ocala.com","Napavalleyregister.com","Clutchmagonline.com","Adelaidenow.com.au/news/south-australia","Courierpress.com","News-journalonline.com","Dailybruin.com","Jconline.com","Newseum.org","Bnd.com","Nwfdailynews.com","Fredericknewspost.com","Standard.net","Amarillo.com","Dcourier.com","Lowellsun.com","Thetelegram.com","Theolympian.com","Pnj.com","Gannett.com","Bendbulletin.com","Rep-am.com","Southbendtribune.com","Spiked-online.com","Yaledailynews.com","Bizjournals.com/tampabay","Trib.com","Chronicle.augusta.com","Rapidcityjournal.com","Bizjournals.com/orlando","Thedp.com","Broadcastingcable.com","Thestarpress.com","Newsroom.ucla.edu","Chnm.gmu.edu","Wcfcourier.com","Theeagle.com","Journaltimes.com","Ubm.com","Andhrabhoomi.net","Redding.com","Weatherforyou.com","Thetimes-tribune.com","Northcountrynow.com","Abcnews.go.com/Video"]
\ No newline at end of file
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment