TestingCatalog is reporting on the latest features of trending apps in AI, web3 and web2

ChatGPT is testing allowlist and blocklist for news websites (full domains dump)

Discover ChatGPT's whitelisted and blacklisted news domains for tailored updates

· 6 min read
ChatGPT is testing allowlist and blocklist for news websites (full domains dump)

A new experiment has been spotted in the code of the latest web release showcasing a list of different domains divided into categories. There are 5 of them in total and only one seems to be related to the allow list while others are rather related to filtering and blocking.

Quivering recent information from these websites is already working in a way that with some domains it works while with others it doesn't. It is not fully clear if this list is experimental or if it is overriding another internal instruction but it makes one point visible - not all domains will be treated equally by OpenAI.

To get a taste of how ChatGPT behaves, you can try asking these questions:

  1. A question about the domain in the allow list: what is the latest news from 15minutenews.com?
  2. A question about the domain in the block list: what about reuters.com?

The first one will be denied and the second will be answered with a list of the latest news.

0:00
/1:02

How ChatGPT works depending on the queried domain

A full list of domains can be found below:

allowed_news_domains

"wikipedia.org",
"reuters.com",
"aljazeera.com",
"politico.com",
"foxnews.com",
"foxsports.com",
"bleacherreport.com",
"sportingnews.com",
"foxsports.com.au",
"indiatoday.in",
"zeenews.india.com"

hard_filtered_domains

"15minutenews.com",
"abc.net.au",
"abc7.com",
"abcnews.com",
"abcnews.go.com",
"apnews.com",
"baidu.com",
"bbc.co.uk",
"bbc.com",
"arstechnica.com",
"axios.com",
"billboard.com",
"bostonglobe.com",
"bloomberg.com",
"businessinsider.com",
"cbsnews.com",
"cbssports.com",
"cnbc.com",
"cnet.com",
"cnn.com",
"eastbaytimes.com",
"eonline.com",
"fivethirtyeight.com",
"forbes.com",
"fortune.com",
"freep.com",
"glamour.com",
"headtopics.com",
"hollywoodreporter.com",
"indianexpress.com",
"inews.co.uk",
"kansascity.com",
"kmbc.com",
"latimes.com",
"marketwatch.com",
"mercurynews.com",
"msn.com",
"msnbc.com",
"nbcbayarea.com",
"nbcnews.com",
"nbclosangeles.com",
"ndtv.com",
"newsday.com",
"news.google.com",
"newspub.live",
"newyorker.com",
"npr.com",
"npr.org",
"nytimes.com",
"people.com",
"popsugar.com",
"realclearpolitics.com",
"reddit.com",
"sacbee.com",
"sfgate.com",
"si.com",
"stylecaster.com",
"sohu.com",
"theathletic.com",
"theatlantic.com",
"theglobeandmail.com",
"theguardian.com",
"thehindu.com",
"theverge.com",
"time.com",
"timesofindia.com",
"townandcountrymag.com",
"usatoday.com",
"usmagazine.com",
"usnews.com",
"variety.com",
"vox.com",
"washingtonpost.com",
"wsj.com",
"accuweather.com",
"weather.com",
"zhihu.com"

blocked_news_domains

"15minutenews.com",
"abc.net.au",
"abc7.com",
"abcnews.com",
"abcnews.go.com",
"apnews.com",
"baidu.com",
"baike.baidu.com",
"bankrate.com",
"bbc.co.uk",
"bbc.com",
"arstechnica.com",
"axios.com",
"billboard.com",
"bostonglobe.com",
"bloomberg.com",
"blooomberg.com",
"businessinsider.com",
"cbsnews.com",
"cbssports.com",
"cnbc.com",
"cnet.com",
"cnn.com",
"cntraveller.com",
"cosmopolitan.com",
"eastbaytimes.com",
"eonline.com",
"facebook.com",
"fivethirtyeight.com",
"forbes.com",
"fortune.com",
"freep.com",
"glamour.com",
"google.com",
"headtopics.com",
"hollywoodreporter.com",
"indianexpress.com",
"inews.co.uk",
"kansascity.com",
"kmbc.com",
"latimes.com",
"linkedin.com",
"lyricstranslate.com",
"marketwatch.com",
"medium.com",
"mercurynews.com",
"msn.com",
"msnbc.com",
"nbcbayarea.com",
"nbcnews.com",
"nbclosangeles.com",
"ndtv.com",
"newsday.com",
"news.google.com",
"newspub.live",
"newyorker.com",
"npr.com",
"npr.org",
"nytimes.com",
"pbs.org",
"people.com",
"plumandbirch.com",
"popsugar.com",
"realclearpolitics.com",
"reddit.com",
"rollingstone.com",
"sacbee.com",
"sfgate.com",
"si.com",
"spiceworks.com",
"stackoverflow.com",
"stylecaster.com",
"sohu.com",
"tandfonline.com",
"techradar.com",
"theathletic.com",
"theatlantic.com",
"theglobeandmail.com",
"theguardian.com",
"thehindu.com",
"theverge.com",
"textile-future.com",
"time.com",
"timesofindia.com",
"tomsguide.com",
"townandcountrymag.com",
"usatoday.com",
"usmagazine.com",
"usnews.com",
"variety.com",
"vox.com",
"washingtonpost.com",
"wsj.com",
"accuweather.com",
"weather.com",
"zhihu.com",
"zhuanlan.zhihu.com"

unconsenting_pii_denylist

"411.com",
"acxiom.com",
"advanced-people-search.com",
"advancedbackgroundchecks.com",
"backgroundalert.com",
"beenverified.com",
"bizapedia.com",
"checkpeople.com",
"classmates.com",
"clustrmaps.com",
"cocofinder.com",
"cyberbackgroundchecks.com",
"factfind.com",
"familytreenow.com",
"fastpeoplesearch.com",
"findpeoplefast.net",
"freepeopledirectory.com",
"homemetry.com",
"idtrue.com",
"infomart-usa.com",
"infotracer.com",
"instantcheckmate.com",
"intelius.com",
"mylife.com",
"neighbor.report",
"nuwber.com",
"officialusa.com",
"okcaller.com",
"openphone.com",
"ourpublicrecords.org",
"peekyou.com",
"peoplebyname.com",
"peoplefinders.com",
"peoplelooker.com",
"peoplesearchnow.com",
"peoplesearchsite.com",
"peoplesmart.com",
"peoplewhiz.com",
"persopo.com",
"pipl.com",
"privateeye.com",
"privaterecords.com",
"publicrecords.report",
"publicrecords.site",
"publicrecordsnow.com",
"publicrecordsofficial.com",
"publicseek.com",
"radaris.com",
"radaris.com",
"rehold.com",
"rocketreach.co",
"searchbug.com",
"searchpeoplefree.com",
"searchquarry.com",
"smartbackgroundchecks.com",
"socialcatfish.com",
"spoke.com",
"spokeo.com",
"spyfly.com",
"thatsthem.com",
"truepeoplesearch.com",
"truthfinder.com",
"unicourt.com",
"unitedstatesphonebook.com",
"unmask.com",
"usa-people-search.com",
"usphonebook.com",
"ussearch.com",
"verecor.com",
"voterrecords.com",
"whitepages.com",
"xlek.com",
"zabasearch.com",
"zoominfo.com"

relaunch_denylist

"1lib.us",
"3lib.net",
"Ebook3000.com",
"Ebookee.com",
"allrecipes.com",
"angi.com",
"angieslist.com",
"annas-archive.org",
"ask.com",
"b-ok.cc",
"bhg.com",
"brides.com",
"byrdie.com",
"care.com",
"cookinglight.com",
"craftjack.com",
"dailybeast.com",
"eatingwell.com",
"ebook-hunter.org",
"ebookbb.com",
"ebookelo.com",
"economist.com",
"eiu.com",
"ereads.net",
"ew.com",
"flibusta.site",
"foodandwine.com",
"freefullpdf.com",
"freetechbooks.com",
"graycity.net",
"handy.com",
"health.com",
"homeadvisor.com",
"homestars.com",
"iac.com",
"ikindlebooks.com",
"instapro.it",
"instyle.com",
"investopedia.com",
"libgen.fun",
"libgen.rs",
"lifewire.com",
"liquor.com",
"magnolia.com",
"mosaic.co",
"my-hammer.de",
"myanonamouse.net",
"mybuilder.com",
"mydomaine.com",
"mywarez.org",
"oceanofpdf.com",
"parents.com",
"pdfdrive.com",
"pdfget.com",
"peopleenespanol.com",
"realsimple.com",
"sanet.st",
"sci-hub.tw",
"seriouseats.com",
"shape.com",
"simplyrecipes.com",
"singlelogin.me",
"southernliving.com",
"southernliving.com",
"the-eye.eu",
"thebalancemoney.com",
"theguardian.com",
"thespruce.com",
"tokybook.com",
"trantor.is",
"travaux.com",
"travelandleisure.com",
"treehugger.com",
"tripsavvy.com",
"verywellhealth.com",
"verywellhealth.com/",
"vivian.com",
"werkspot.nl/",
"yudhacookbook.my.id",
"z-lib.org",
"6abc.com",
"abc.com",
"abc13.com",
"abc7chicago.com",
"abc7news.com",
"abc7ny.com",
"academia.stackexchange.com",
"acsess.onlinelibrary.wiley.com",
"acsjournals.onlinelibrary.wiley.com",
"actu.fr",
"agupubs.onlinelibrary.wiley.com",
"ai.plainenglish.io",
"ai.stackexchange.com",
"aibusiness.com",
"aimultiple.com",
"ajp.psychiatryonline.org",
"ajph.aphapublications.org",
"alphahistory.com",
"alz-journals.onlinelibrary.wiley.com",
"americanaddictioncenters.org",
"analisa.io",
"analyticalsciencejournals.onlinelibrary.wiley.com",
"android.stackexchange.com",
"apnews.com",
"app.noteable.io",
"apple.stackexchange.com",
"ar5iv.labs.ar5iv.org",
"arc.aiaa.org",
"architizer.com",
"archiveofourown.org",
"artfasad.com",
"ascelibrary.org",
"ascopubs.org",
"asia.nikkei.com",
"askubuntu.com",
"attackofthefanboy.com",
"au.pcmag.com",
"au.trustpilot.com",
"autoesporte.globo.com",
"aviation.stackexchange.com",
"aws.plainenglish.io",
"baike.baidu.com",
"baike.so.com",
"baike.sogou.com",
"balkaninsight.com",
"baomoi.com",
"beinsure.com",
"bera-journals.onlinelibrary.wiley.com",
"besjournals.onlinelibrary.wiley.com",
"bettermarketing.pub",
"betterprogramming.pub",
"bitcoin.stackexchange.com",
"biz.chosun.com",
"blender.stackexchange.com",
"blenderartists.org",
"blog.bitsrc.io",
"blog.devgenius.io",
"blog.gopenai.com",
"blog.landr.com",
"blog.stackademic.com",
"bloggingwizard.com",
"blogs.scientificamerican.com",
"boardgamegeek.com",
"bobbyhadz.com",
"bookanalysis.com",
"bookriot.com",
"books.google.com",
"bootcamp.uxdesign.cc",
"bpspsychub.onlinelibrary.wiley.com",
"buffalonews.com",
"business.nikkei.com",
"businessmodelanalyst.com",
"buy.stripe.com",
"bvmsports.com",
"ca.trustpilot.com",
"ca.vlex.com",
"canliiconnects.org",
"case-law.vlex.com",
"chat.openai.com",
"chemistry-europe.onlinelibrary.wiley.com",
"chicago.eater.com",
"chicago.suntimes.com",
"chinawto.mofcom.gov.cn",
"cir.nii.ac.jp",
"classic.austlii.edu.au",
"codereview.stackexchange.com",
"collider.com",
"colorlib.com",
"communityimpact.com",
"comparisons.financesonline.com",
"compass.onlinelibrary.wiley.com",
"context.reverso.net",
"cookpad.com",
"countryeconomy.com",
"cpad.io",
"cpc.people.com.cn",
"cryptorank.io",
"cs.stackexchange.com",
"culturedvultures.com",
"cybernews.com",
"dangjian.people.com.cn",
"dangshi.people.com.cn",
"data-lead.com",
"data.iimedia.cn",
"database.earth",
"datascience.stackexchange.com",
"datasheets.globalspec.com",
"datosmacro.expansion.com",
"dba.stackexchange.com",
"de.trustpilot.com",
"deadline.com",
"dealspotr.com",
"decider.com",
"devcodef1.com",
"dialnet.unirioja.es",
"dict.leo.org",
"dictionary.cambridge.org",
"dictionary.reverso.net",
"digital-strategy.ec.europa.eu",
"diplomeo.com",
"dir.indiamart.com",
"discussions.unity.com",
"disney.fandom.com",
"district.ce.cn",
"dknetwork.draftkings.com",
"dl.acm.org",
"dlnext.acm.org",
"docs.blender.org",
"doctor.webmd.com",
"dodropshipping.com",
"dof.gob.mx",
"dotesports.com",
"downdetector.com",
"download.macrotrends.net",
"eandt.theiet.org",
"earlygame.com",
"economia.uol.com.br",
"economictimes.indiatimes.com",
"econtent.hogrefe.com",
"edu.people.com.cn",
"efsa.onlinelibrary.wiley.com",
"ehp.niehs.nih.gov",
"eiga.com",
"elearningindustry.com",
"electronics.stackexchange.com",
"elibrary.worldbank.org",
"emedicine.medscape.com",
"emeritus.org",
"en.namu.wiki",
"en.yna.co.kr",
"energycentral.com",
"english.scio.gov.cn",
"english.stackexchange.com",
"ent.people.com.cn",
"epocanegocios.globo.com",
"epubs.siam.org",
"es.scribd.com",
"esajournals.onlinelibrary.wiley.com",
"ethereum.stackexchange.com",
"europepmc.org",
"experts.illinois.edu",
"experts.umn.edu",
"export.ar5iv.org",
"facebook.com",
"faq.usps.com",
"faroutmagazine.co.uk",
"fastercapital.com",
"fbref.com",
"federalnewsnetwork.com",
"fernfortuniversity.com",
"finance.ce.cn",
"finance.people.com.cn",
"financesonline.com",
"firmeneintrag.creditreform.de",
"firstsiteguide.com",
"flixpatrol.com",
"focus.psychiatryonline.org",
"footwearnews.com",
"foreignpolicy.com",
"forgottenrealms.fandom.com",
"fourminutebooks.com",
"foursquare.com",
"fr.trustpilot.com",
"game8.co",
"game8.jp",
"gamedev.stackexchange.com",
"gamerant.com",
"gameriv.com",
"gaming.stackexchange.com",
"gatherer.wizards.com"

Insight on ChatGPT

ChatGPT, constantly evolving under OpenAI's wing, is breaking grounds in conversational AI. By understanding and generating human-like text, it is ideal for tasks such as translation, answering questions, and now, tailored news provision. This feature is a testament to the AI's growing utility and platform reliability, setting a new benchmark for AI interaction with real-world information.

Sources

The information about the new allowlist and blacklist for ChatGPT query capabilities towards different news websites was obtained through reverse-engineering efforts of the TestingCatalog team. The data interpreted from the source files was not published through an official announcement, hence reflecting the findings as of the most recent analysis of the application's codebase.

At TestingCatalog, the authenticity and credibility of our insights remain paramount. Our editor painstakingly verifies each feature and its underlying mechanisms personally to ensure our readers receive accurate and trustworthy reports on the newest tech trends.