Wikipedia:Link rot/URL change requests
This page is for requesting modifications to URLs, such as marking dead or changing to a new domain. Some bots are designed to fix link rot; they can be notified here. These bots include InternetArchiveBot and WaybackMedic. This page can be monitored by bot operators from other language wikis since URL changes are universally applicable.
US agencies
[edit]This section is pinned and will not be automatically archived. |
90+ agencies identified as having web pages deleted during the Trump admin: https://asia.nikkei.com/static/vdata/infographics/deleted-website/
ω Awaiting further developments and time to go through them -- GreenC 16:46, 1 April 2025 (UTC)
White HouseDepartment of Health and Human ServicesDepartment of AgricultureUSAIDNational Park Serviceworker.govDepartment of LaborU.S. Agency for Global Media - usagm.govFederal Mediation and Conciliation Service (United States) - fmcs.govWoodrow Wilson International Center for Scholars - wilsoncenter.orgInstitute of Museum and Library Services - imls.govCommunity Development Financial Institutions Fund - cdfifund.govMinority Business Development Agency - mbda.govDepartment of Transportation - dot.gov- includes 11 agencies: FAA, FHWA, FMCSA, FRA, FTA, GLS, MARAD, NHTSA, OIG, OST, PHMSAEnvironmental Protection Agency - epa.govDepartment of Housing and Urban Development - hud.gov- Centers for Disease Control and Prevention
- Federal Emergency Management Agency
- National Institutes of Health
- General Services Administration
- Department of Homeland Security
- Department of Commerce
- employer.gov
- Office of the Assistant Secretary for Health
- sftool.gov
- Department of Energy
- Department of the Interior
- Department of Education
- NOAA
- Substance Abuse and Mental Health Services Administration
climate.gov- Department of Defense
- Health Resources & Services Administration
- AbilityOne Commission
- Department of State
- United States Patent and Trademark Office
- BOEM
- The Census Bureau
- CISA
- HUD User
- MILLENNIUM CHALLENGE CORPORATION
- performance.gov
- National Archives and Records Administration
- Bureau of Safety and Environmental Enforcement
- Federal Aviation Administration
Food and Drug Administration- House of Representatives
- Department of Justice
National Endowment for the Humanities- Department of the Treasury
- youth.gov
- American Climate Corps
- Federal Trade Commission
- Global Change Research Program
- NASA
- Administration for Community Living
- National Endowment for the Arts
- ATF
- Bureau of Indian Affairs
- Customs and Border Protection
- Consumer Financial Protection Bureau
- Consumer Product Safety Commission
- Office of the Director of National Intelligence
- Economic Development Administration
- Equal Employment Opportunity Commission
- Export-Import Bank of the United States
- FBI
- Federal Committee on Statistical Methodology
- Federal Housing Finance Agency
- geoplatform.gov
- Assistant Secretary for Technology Policy
- IRS
- National Labor Relations Board
- Office of Personnel Management
- Department of Veterans Affairs
- American Battle Monuments Commission
- Agency for Healthcare Research and Quality
americorps.gov- Advanced Research Projects Agency for Health
- Bonneville Power Administration
- cms.gov
- congress.gov
- digital.gov
- ENERGY STAR
- ej.gov
- farmers.gov
- medicalcountermeasures.gov
- peacecorps.gov
- Securities and Exchange Commission
- Social Security Administration
- stopbullying.gov
- Citizenship and Immigration Services
United States Interagency Council on Homelessness- workcenter.gov
Air ForceArmyNavy- Marine Corps
thestage.co.uk
[edit]It was requested here that the listed domain underwent restructuring and now all links 404 even though the pages still exist at another URL. Listing it here in lieu of them. there are 3,081 uses of this source in citations. cyberdog958Talk 22:33, 28 May 2025 (UTC)
- User:Cyberdog958 thank you for the referral. This is no problem. According to the IP editor: "we failed to update when we migrated platform". There is not much that can be done to make the links live, but they can be converted to archive links so they are still readable eg. [1] If there is an existing map of old to new the links can be changed easily. -- GreenC 04:45, 29 May 2025 (UTC)
- Hi GreenC
- Thanks for your update.
- We can create a map, which would of course take some time as each article linked to needs to be found on the current site and added to the map.
- Once we have done this, to whom and how should it be submitted please? And in what format? Would csv suffice? Is the Wiki URL / page on which the broken link is found need to be listed, or will the broken link be sufficient? 149.86.35.195 (talk) 07:04, 29 May 2025 (UTC)
- Great! Here are the list of URLs: Wikipedia:Link rot/Cases/thestage.co.uk .. After the map is created, I will make the changes onwiki via bot. Some URLs might have commas, prefer tab separated or space separate is easiest. When you are ready, post the map to the /Cases/ page overwriting current content then notify me.
- There is a limitation you should know about. The list of URLs is for English wiki and every other wiki (300+). However, at this time, I can only update English wikipedia .. not the other wikis. In the future I will be able to update the other wikis, but when that will happen is unknown, it could be a long time. Nevertheless I keep a log of potential changes, when that feature becomes available I'll push them through. Most of the links are on English wiki anyway. -- GreenC 17:29, 29 May 2025 (UTC)
- Hi GreenC
- I hope all is well in your world!
- Before investing in creating a map for so many links, it seems prudent to check that this works as intended.
- I have a test file of just 7 links, and forgive me if I am being a bit dense here, I can't see how I can upload it or get it to you. I have visited Wikipedia:Link rot/Cases/thestage.co.uk but don't see how to submit these test updates.
- Please advise
- With thanks
- Simon StageSimon (talk) 12:37, 3 June 2025 (UTC)
- StageSimon, all is well thank you. I'm not sure what interface to Wikipedia you are using, but, there should be a button somewhere that says "Edit this page" or "Edit source". Then you see the first line says:
- Create a map like this:
- And that's "it" (many times). Recommend copy-paste the entire list to your local computer and create the map there with your local tools, and when ready copy-paste the entire list back into this page using the Edit button again. Hope that helps any questions let me know. -- GreenC 18:01, 3 June 2025 (UTC)
- Hi
- I am sorry but I still don't understand the mapping format. Your "Create a map like this:" above doesn't seem to explain what should be pasted back in (other than the new target links)
- Are you saying that the order of the URLs in the link rot page is tied to the articles, so it's the order of the links that relates back to the wiki entry? I don't otherwise how it can be discerned.
- As an example, I want
- https://www.thestage.co.uk/features/2017/edinburgh-adelaide-fringe-holden-street-theatres-award
- to now go to
- https://www.thestage.co.uk/features/from-edinburgh-to-adelaide-the-prize-taking-shows-down-under
- Is it just a case that where it appears in the list will be enough to tie it back to the wiki page?
- Thanks for your patience with me!
- Simon 62.232.21.50 (talk) 13:05, 4 June 2025 (UTC)
- Each line of the map look like this:
<old URL>[SPACE]<new URL>
It is mapping the old url to the new url.
For example:https://oldsite.com https://newsite.com
That is all. Don't worry about how lines are ordered or what page the link was found, the bot will take care of it. — GreenC 17:09, 4 June 2025 (UTC)
- Each line of the map look like this:
ω Awaiting map -- GreenC 19:42, 7 June 2025 (UTC)
- Hi GreenC
- The map is not yet built - it will take quite some time as there's no pattern - so each new url will need to be manually found.
- Can I test this by just uploading 7 test URLs - if so can I just add those, or do I need to include them in the whole list and re-paste that? Clearly I do not want to disturb all the other links, be they correct or incorrect.
- You've been so helpful, I really appreciate it.
- Simon 149.86.35.195 (talk) 09:31, 8 June 2025 (UTC)
- Take your time this kind of work can take a long time, particularly if it's all manual! There is always Amazon Mechanical Turk. You can change that page anyway you like. — GreenC 20:32, 9 June 2025 (UTC)
- Hi
- I uploaded 7 test URL updates this morning (say 7 hours ago) - none have updated yet.
- Either the bot is very busy or I did it wrong.
- Appreciate you taking a look when you have chance.
- Thank you
- Simonm StageSimon (talk) 16:08, 10 June 2025 (UTC)
- StageSimon, Once you have uploaded all the links I will process them with the bot. I do it in batches it's semi-automated. I will download the links from that page to my local computer into a file, and the bot will run that file as the input. — GreenC 19:23, 10 June 2025 (UTC)
- Hi GreenC
- I hope all is good with you.
- Just uploaded 894 link corrections.
- Could you please process with the bot.
- Very much appreciated.
- Simon StageSimon (talk) 15:56, 7 July 2025 (UTC)
- Hi GreenC
- I wonder if you are away on holiday as atypically you've not responded .
- Best
- Simon 62.232.21.50 (talk) 12:25, 16 July 2025 (UTC)
- Hi sorry I'll start on this after I complete Wikipedia:Link rot/URL change requests#nih.gov .. each project requires retooling the program, and uses a lot of compute resources, so I do them one at a time. Thanks for the reminder. - — GreenC 14:39, 16 July 2025 (UTC)
- Hey
- Please don't apologise, you are the one helping me.
- Thanks and no doubt you will ping me once you've got to it.
- All the best
- Simon StageSimon (talk) 16:05, 16 July 2025 (UTC)
- Hi StageSimon: stats below. Click the first link to see the diffs. Any problems let me know -- GreenC 04:05, 21 July 2025 (UTC)
- Hi GreenC
- Thanks for your efforts and the update.
- Apologies but I am a little confused. I updated 894 links that now point to the correct page on the live site. The intention was then to supply you with the next batch once they were ready, so only those 894 links should have required a change. Sorry I don't think I was too clear about that when I submitted them.
- Have other changes been made too? (e.g. a lot of wayback medic links e.g.WaybackMedic_2.5|Wayback Medic 2.5]] per WP:URLREQ#thestage.co.uk
- If so, what, and does that preclude further updates? Is it possible to re-run the linkrot analysis again, so we get a fresh data set of links that 404 please?
- Visiting https://sigma.toolforge.org/summary.py I get an internal server error when trying to reach page 2, FYI.
- Most grateful for the time you put into this. Thank you and apologies if I am being a bit slow here!
- S StageSimon (talk) 13:14, 21 July 2025 (UTC)
- Hi StageSimon: stats below. Click the first link to see the diffs. Any problems let me know -- GreenC 04:05, 21 July 2025 (UTC)
- Hi sorry I'll start on this after I complete Wikipedia:Link rot/URL change requests#nih.gov .. each project requires retooling the program, and uses a lot of compute resources, so I do them one at a time. Thanks for the reminder. - — GreenC 14:39, 16 July 2025 (UTC)
- StageSimon, Once you have uploaded all the links I will process them with the bot. I do it in batches it's semi-automated. I will download the links from that page to my local computer into a file, and the bot will run that file as the input. — GreenC 19:23, 10 June 2025 (UTC)
- Take your time this kind of work can take a long time, particularly if it's all manual! There is always Amazon Mechanical Turk. You can change that page anyway you like. — GreenC 20:32, 9 June 2025 (UTC)
- ┌───────────────────────────┘
StageSimon, oh it wasn't clear this was a batch upload. In any case, after you upload another set the bot will convert a currently archived URL to the new live link so no problem. The sigma,toolforge tool (not mine) is kind of flake nothing I can do about it, try again later. — GreenC 19:46, 21 July 2025 (UTC)
Enwiki
- Checked 3,117 pages and edited 2,211 pages. Moved 1,012 links to a new URL: 64 normal redirects, 948 ruled mapped redirects, Resolved 126 soft-404s. Removed 3
{{dead link}}
. Added 115{{dead link}}
. Switched 90|url-status=dead
to live. Switched 118|url-status=live
to dead. Added 1,627 archive URLs (1,308 Wayback).
www.konami-data.com
[edit]While editing List of Nintendo DS Wi-Fi Connection games, I found that the URL was usurped, and redirects to a fake captcha malware now. KamiraMV (talk) 03:28, 4 June 2025 (UTC)
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
adventuregamers.com
[edit]Adventure Gamers has been usurped by its new owners to focus on gambling promotion per discussion at the video games project. Impact seems small with 33 pages. Sariel Xilo (talk) 15:18, 15 June 2025 (UTC)
- It's adventuregamerS.com, with a plural S at the end.--LaukkuTheGreit (Talk•Contribs) 15:26, 15 June 2025 (UTC)
- My bad!
adventuregamers.com
has more pages over 800. Sariel Xilo (talk) 15:32, 15 June 2025 (UTC)
- My bad!
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
aje.io
[edit]@GreenC, Al Jazeera now uses aje.io domain as their URL redirection site for their articles. As they are shortlinks, we would not be able to track or trace back to the original webpage if they choose to sunset the service. As the use is widespread, and to preemptively prevent link rot, can you help with the expansion of the URLs? I will have this domain added to Special:BlockedExternalDomains as well (when would be a better time, now or after the URLs have been expanded?) – robertsky (talk) 15:44, 24 June 2025 (UTC)
- 206 pages.
- – robertsky, this is a good idea. The bot will automatically expand the URLs as they are normal 301 redirects. It probably won't matter when they are blocked because the bot is not saving the short form. If you want to wait out of caution that's fine or if you want to block now and something breaks, I can let you know and we can unblock temporarily (if not disruptive to some other process). This job is in the queue it might be a week or so before I finish the requests preceding. But if you want it faster can do, 200 pages won't take long. -- GreenC 16:06, 24 June 2025 (UTC)
- @GreenC I have added the block list. This would cut down the amount of work or runs the bot has to do after the initial run. – robertsky (talk) 14:18, 25 June 2025 (UTC)
Enwiki
- Checked 206 pages and edited 197 pages. Moved 2,030 links to a new URL: 2,028 normal redirects, 2 ruled mapped redirects, Resolved 1 soft-404s. Removed 1
{{dead link}}
. Added 6{{dead link}}
. Switched 39|url-status=dead
to live. Added 1 archive URLs (1 Wayback).
- Very high density @ Timeline of the Gaza war (7 May 2024 – 12 July 2024) (401) and Timeline of the Gaza war (13 July 2024 – 26 September 2024) (360) discussed Talk:Timeline_of_the_Gaza_war_(7_May_2024_–_12_July_2024)#Al-Jazeera_?
Done -- GreenC 20:07, 30 June 2025 (UTC)
- Many were missed due to
{{#invoke:Cite
which editors are using to hack around the limitation on number of templates in an article. My bot does not support invoked cite templates. Wrote a new script for this: Special:Diff/1298149769/1298518263 -- GreenC 01:15, 3 July 2025 (UTC)- Thanks. I noticed that the aje.io got archived as 301 as well. When the dust is settled, I will scrub and update the archive links. – robertsky (talk) 01:56, 3 July 2025 (UTC)
- The bot should have done this. I didn't configure it correctly during setup. Fixed. -- GreenC 17:45, 6 July 2025 (UTC)
- thanks! – robertsky (talk) 17:55, 6 July 2025 (UTC)
- The bot should have done this. I didn't configure it correctly during setup. Fixed. -- GreenC 17:45, 6 July 2025 (UTC)
- Thanks. I noticed that the aje.io got archived as 301 as well. When the dust is settled, I will scrub and update the archive links. – robertsky (talk) 01:56, 3 July 2025 (UTC)
epa.gov
[edit]Environmental Protection Agency
7,744 pages -- GreenC 14:48, 1 July 2025 (UTC)
Enwiki
- Checked 7,752 pages and edited 3,859 pages. Moved 2,306 links to a new URL: 785 normal redirects, 1,010 ruled mapped redirects, 511 ghost mapped redirects, Resolved 859 soft-404s. Removed 5
{{dead link}}
. Added 336{{dead link}}
. Switched 57|url-status=dead
to live. Switched 375|url-status=live
to dead. Added 4,211 archive URLs (2,733 Wayback).
- This site was challenging but in the process I developed a new technique to discover soft-404s which will be applicable to future sites also
IABot DB
- Checked 17,487 unique URLs and updated 11,718 to propagate through 300+ wikis
Done -- GreenC 00:37, 7 July 2025 (UTC)
Further work
- The /iaq section moved to /indoor-air-quality-iaq, and https://www.epa.gov/indoor-air-quality-iaq has links suggesting that many individual pages still exist but under different names (with no universal regex mapping). 81 pages for manual attention. DMacks (talk) 05:36, 6 July 2025 (UTC)
- Yes this is a large complex site with page moves on top of page moves, since the 1990s, during political and technology changes. If anyone wants to go through the dead links looking for mapping rules to live pages, let me know, I'll program the rules. I found some obvious ones already but there are likely many more. -- GreenC 17:53, 6 July 2025 (UTC)
- surf-your-watershed (282 pages) moved to https://www.epa.gov/waterdata/hows-my-waterway -- GreenC 17:58, 6 July 2025 (UTC)
- Done. -- GreenC 00:37, 7 July 2025 (UTC)
hud.gov
[edit]Department of Housing and Urban Development
319 pages -- GreenC 14:50, 1 July 2025 (UTC)
Enwiki
- Checked 319 pages and edited 179 pages. Moved 83 links to a new URL: 5 normal redirects, 63 ruled mapped redirects, 15 ghost mapped redirects, Resolved 50 soft-404s. Added 2
{{dead link}}
. Switched 16|url-status=dead
to live. Switched 8|url-status=live
to dead. Added 178 archive URLs (163 Wayback).
IABot DB
- Checked about 628 and updated about 500 to propagate through 300+ wikis
Done -- GreenC 05:53, 7 July 2025 (UTC)
globalchange.gov
[edit]US climate reporting site. Removed by Trump admin 1 July 2025.
105 pages -- GreenC 04:45, 3 July 2025 (UTC)
Enwiki
- Checked 107 pages and edited 83 pages. Switched 33
|url-status=live
to dead. Added 142 archive URLs (118 Wayback). Changed 10 citation metadata.
IABot DB
- Checked 170 unique URLs and updated 170 to propagate through 300+ wikis
Done
slidewiki.org
[edit]SlideWiki's website has been usurped by a gambling/ scam site. Likely due to a domain expiration being taken advantage of, but the legitimacy of that speculation is currently unknown. DrCzyżew (talk) 23:48, 6 July 2025 (UTC)
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
Five USA set 2
[edit]colorado.gov
[edit]- Colorado: 1,557 pages
- Enwiki
- Checked 1,630 pages and edited 870 pages. Moved 555 links to a new URL: 188 normal redirects, 178 ruled mapped redirects, 189 ghost mapped redirects, Resolved 1,055 soft-404s. Added 24
{{dead link}}
. Switched 27|url-status=dead
to live. Switched 60|url-status=live
to dead. Added 759 archive URLs (588 Wayback).
- Checked 1,630 pages and edited 870 pages. Moved 555 links to a new URL: 188 normal redirects, 178 ruled mapped redirects, 189 ghost mapped redirects, Resolved 1,055 soft-404s. Added 24
- IABot DB
- Checked 1,466 unique links and updated 664 which propagate to 300+ wikis
Done
ct.gov
[edit]- Connecticut: 2,831 pages
- Enwiki
- Checked 2,851 pages and edited 1,670 pages. Moved 1,223 links to a new URL: 418 normal redirects, 611 ruled mapped redirects, 194 ghost mapped redirects, Resolved 1,797 soft-404s. Added 28
{{dead link}}
. Switched 21|url-status=dead
to live. Switched 68|url-status=live
to dead. Added 1,521 archive URLs (1,433 Wayback).
- This site sometimes has different web servers for http vs https, with different header content, confusing me and the bot. Lots of soft-404s typical of .gov
- Checked 2,851 pages and edited 1,670 pages. Moved 1,223 links to a new URL: 418 normal redirects, 611 ruled mapped redirects, 194 ghost mapped redirects, Resolved 1,797 soft-404s. Added 28
- IABot DB
- Checked 3,648 unique links and updated 1,531
Done -- GreenC 00:14, 12 July 2025 (UTC)
delaware.gov
[edit]- Delaware: 1,249 pages
- Enwiki
- Checked 1,275 pages and edited 721 pages. Moved 537 links to a new URL: 64 normal redirects, 456 ruled mapped redirects, 17 ghost mapped redirects, Resolved 127 soft-404s. Added 12
{{dead link}}
. Switched 69|url-status=dead
to live. Switched 18|url-status=live
to dead. Added 748 archive URLs (609 Wayback).
- Checked 1,275 pages and edited 721 pages. Moved 537 links to a new URL: 64 normal redirects, 456 ruled mapped redirects, 17 ghost mapped redirects, Resolved 127 soft-404s. Added 12
- IABot DB
- Checked 1,767 unique links and updated 647
Done
fl.gov
[edit]- Florida: 368 pages
- Enwiki
- Checked 369 pages and edited 36 pages. Moved 16 links to a new URL: 12 normal redirects, 3 ruled mapped redirects, 1 ghost mapped redirects, Resolved 25 soft-404s. Switched 4
|url-status=dead
to live. Added 30 archive URLs (30 Wayback).
- Checked 369 pages and edited 36 pages. Moved 16 links to a new URL: 12 normal redirects, 3 ruled mapped redirects, 1 ghost mapped redirects, Resolved 25 soft-404s. Switched 4
- IABot DB
- Checked 92 links and updated 43
Done -- GreenC 19:49, 12 July 2025 (UTC)
myflorida.com
[edit]- Florida: 681 pages
- Enwiki
- Checked 681 pages and edited 375 pages. Moved 432 links to a new URL: 241 normal redirects, 178 ruled mapped redirects, 13 ghost mapped redirects, Resolved 64 soft-404s. Added 1
{{dead link}}
. Switched 9|url-status=dead
to live. Switched 5|url-status=live
to dead. Added 66 archive URLs (60 Wayback).
- Checked 681 pages and edited 375 pages. Moved 432 links to a new URL: 241 normal redirects, 178 ruled mapped redirects, 13 ghost mapped redirects, Resolved 64 soft-404s. Added 1
- IABot DB
- Checked 1,220 unique links and updated 147 to propagate through 300+ wikis
Done
georgia.gov
[edit]- Georgia: 651 pages
- Enwiki
- Checked 652 pages and edited 359 pages. Moved 47 links to a new URL: 5 normal redirects, 14 ruled mapped redirects, 28 ghost mapped redirects, Resolved 17 soft-404s. Added 3
{{dead link}}
. Switched 2|url-status=dead
to live. Switched 40|url-status=live
to dead. Added 376 archive URLs (362 Wayback).
- Checked 652 pages and edited 359 pages. Moved 47 links to a new URL: 5 normal redirects, 14 ruled mapped redirects, 28 ghost mapped redirects, Resolved 17 soft-404s. Added 3
- IABot DB
- Checked 1,118 links and updated 973
Done - GreenC 19:40, 12 July 2025 (UTC)
fema.gov
[edit]Federal Emergency Management Agency -- GreenC 19:53, 12 July 2025 (UTC)
Enwiki
- Checked 840 pages and edited 434 pages. Moved 211 links to a new URL: 77 normal redirects, 130 ruled mapped redirects, 4 ghost mapped redirects, Resolved 135 soft-404s. Added 32
{{dead link}}
. Switched 4|url-status=dead
to live. Switched 59|url-status=live
to dead. Added 386 archive URLs (359 Wayback).
IABot DB
- Checked 1,592 and updated 1,217
Done -- GreenC 03:38, 13 July 2025 (UTC)
nih.gov
[edit]National Institutes of Health -- GreenC 19:55, 12 July 2025 (UTC)
Site has bot blocking technology that I successfully breached, with future application for other sites (building up a library of methods to circumvent bot blockers). Also testing a new AI-driven method for soft-404 detection to reduce manual checking. And a new AI method for finding ruled mapped redirects. -- GreenC 19:21, 14 July 2025 (UTC)
Enwiki
- Pass 1 (00001-05000): Checked 5,000 pages and edited 2,303 pages. Moved 2,332 links to a new URL: 647 normal redirects, 1,612 ruled mapped redirects, 73 ghost mapped redirects, Resolved 300 soft-404s. Removed 1
{{dead link}}
. Added 126{{dead link}}
. Switched 26|url-status=dead
to live. Switched 86|url-status=live
to dead. Added 605 archive URLs (547 Wayback).
- Pass 2 (05001-10000): Checked 5,000 pages and edited 2,184 pages. Moved 2,232 links to a new URL: 422 normal redirects, 1,742 ruled mapped redirects, 68 ghost mapped redirects, Resolved 297 soft-404s. Removed 1
{{dead link}}
. Added 85{{dead link}}
. Switched 21|url-status=dead
to live. Switched 71|url-status=live
to dead. Added 514 archive URLs (476 Wayback).
- Pass 3 (10001-20000): Checked 10,000 pages and edited 4,510 pages. Moved 4,469 links to a new URL: 802 normal redirects, 3,550 ruled mapped redirects, 117 ghost mapped redirects, Resolved 726 soft-404s. Removed 4
{{dead link}}
. Added 131{{dead link}}
. Switched 50|url-status=dead
to live. Switched 156|url-status=live
to dead. Added 1,175 archive URLs (1,065 Wayback).
- Pass 4 (20001-33500): Checked 13,500 pages and edited 6,104 pages. Moved 6,438 links to a new URL: 1,343 normal redirects, 4,952 ruled mapped redirects, 143 ghost mapped redirects, Resolved 912 soft-404s. Removed 1
{{dead link}}
. Added 164{{dead link}}
. Switched 80|url-status=dead
to live. Switched 211|url-status=live
to dead. Added 1,505 archive URLs (1,374 Wayback).
- Pass 5 (33500-47048): Checked 13,565 pages and edited 5,971 pages. Moved 5,862 links to a new URL: 1,082 normal redirects, 4,635 ruled mapped redirects, 145 ghost mapped redirects, Resolved 937 soft-404s. Removed 4
{{dead link}}
. Added 175{{dead link}}
. Switched 56|url-status=dead
to live. Switched 240|url-status=live
to dead. Added 1,573 archive URLs (1,463 Wayback).
IABot DB
- Checked over 500,000 unique URLs and updated about 32,000
Done -- GreenC 04:50, 31 July 2025 (UTC)
gsa.gov
[edit]General Services Administration -- GreenC 19:57, 12 July 2025 (UTC)
Enwiki
- Checked 756 pages and edited 508 pages. Moved 140 links to a new URL: 75 normal redirects, 16 ruled mapped redirects, 49 ghost mapped redirects, Resolved 60 soft-404s. Added 27
{{dead link}}
. Switched 5|url-status=dead
to live. Switched 46|url-status=live
to dead. Added 450 archive URLs (433 Wayback).
IABot DB
- Checked 1,105 unique URLs and updated 852
Done -- GreenC 17:58, 13 July 2025 (UTC)
cyclingarchives.com
[edit]This site has been usurped by a gambling site. Therefore, I request for this to be added to JUDI. ~4200 articles. Thank you! MrLinkinPark333 (talk) 00:51, 14 July 2025 (UTC)
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
newsarama.com
[edit]This site got merged to Gamesradar in 2020. I didn't find a replacement link for Superman Unchained at the new website. Therefore, I request archives links. If ghost redirects are found, then that is a bonus. ~3000 articles. Thanks again! MrLinkinPark333 (talk) 00:56, 14 July 2025 (UTC)
Enwiki
- Checked 2,992 pages and edited 2,113 pages. Added 315
{{dead link}}
. Switched 961|url-status=live
to dead. Added 2,661 archive URLs (2,425 Wayback).
IABot DB
- Checked and updated 5,633 unique URLs
Done -- GreenC 20:26, 31 July 2025 (UTC)
www.dplh.wa.gov.au
[edit]Please have all links on this subdomain marked as dead, as they all redirect to the new homepage of this department (example). Only 152 links, many of them not in the main namespace ... and some of these pages need manual updates, but archiving the links would be a lot better than nothing. Thanks! Graham87 (talk) 04:33, 14 July 2025 (UTC)
Enwiki
- Checked 122 pages and edited 113 pages. Switched 11
|url-status=live
to dead. Added 121 archive URLs (121 Wayback).
IABot DB
- Checked and updated 57 unique URLs
Done -- GreenC 21:27, 31 July 2025 (UTC)
ctvnews.ca
[edit]It appears ctvnews.ca changed the format of their urls at some point. They were of the format area.ctvnews.ca and are now ctvnews.ca/area/. A couple of examples, in August 8 the url "https://toronto.ctvnews.ca/former-premier-of-ontario-william-davis-dead-at-92-1.5539011" will auto redirect you to "https://www.ctvnews.ca/toronto/article/former-premier-of-ontario-william-davis-dead-at-92/".
However in May 1 the url "https://atlantic.ctvnews.ca/it-s-got-to-make-some-kind-of-change-boycott-of-loblaws-owned-stores-begins-1.6869575" results in a 404, even though the article still exists at "https://www.ctvnews.ca/atlantic/article/its-got-to-make-some-kind-of-change-boycott-of-loblaws-owned-stores-begins/". The redirect fails because it tries to redirect to "/it-s-" while the article is at "/its-".
I'm not sure how many areas there are but they include; Toronto, Atlantic, Montreal, Edmonton, Regina, and bc. There could be others. -- LCU ActivelyDisinterested «@» °∆t° 11:56, 17 July 2025 (UTC)
- There was another domain Wikipedia:Link_rot/URL_change_requests#gamasutra.com with the same apostrophe problem (it-s vs its). The techniques learned there will be applicable here. -- GreenC 15:21, 17 July 2025 (UTC)
- Domain required a lot of special rules, it is now able to convert about 70% to live links. The other 30% are dead links mostly, but also some edge cases the rules can't catch, maybe 10% of that 30%. -- GreenC 21:05, 26 July 2025 (UTC)
- Amazing work, thanks GreenC. -- LCU ActivelyDisinterested «@» °∆t° 21:15, 26 July 2025 (UTC)
- Domain required a lot of special rules, it is now able to convert about 70% to live links. The other 30% are dead links mostly, but also some edge cases the rules can't catch, maybe 10% of that 30%. -- GreenC 21:05, 26 July 2025 (UTC)
Enwiki
- Batch 1 (00000-05000): Checked 5,000 pages and edited 4,639 pages. Moved 5,510 links to a new URL: 423 normal redirects, 4,979 ruled mapped redirects, 108 ghost mapped redirects, Resolved 123 soft-404s. Removed 4
{{dead link}}
. Added 39{{dead link}}
. Switched 122|url-status=dead
to live. Switched 642|url-status=live
to dead. Added 2,080 archive URLs (1,976 Wayback).
- Batch 2 (05001-10214): Checked 5,214 pages and edited 4,835 pages. Moved 5,546 links to a new URL: 396 normal redirects, 5,065 ruled mapped redirects, 85 ghost mapped redirects, Resolved 133 soft-404s. Removed 5
{{dead link}}
. Added 48{{dead link}}
. Switched 122|url-status=dead
to live. Switched 770|url-status=live
to dead. Added 2,012 archive URLs (1,941 Wayback).
IABot DB
- Checked 25,039 and updated 7,563 unique URLs
Done -- GreenC 03:25, 3 August 2025 (UTC)
economist.com/node/
[edit]These links redirect to new URLs like this to that for BASIC countries. I tested a few with question marks at the end and they seem to be redirecting. 2,951 articles. Thanks! MrLinkinPark333 (talk) 01:39, 19 July 2025 (UTC)
Enwiki
- Checked 2,953 pages and edited 2,907 pages. Moved 3,310 links to a new URL: 377 normal redirects, 2,928 ruled mapped redirects, 5 ghost mapped redirects, Resolved 11 soft-404s. Added 4
{{dead link}}
. Switched 81|url-status=dead
to live. Switched 10|url-status=live
to dead. Added 17 archive URLs (15 Wayback).
- found another 200 fixed -- GreenC 03:21, 3 August 2025 (UTC)
Done -- GreenC 02:17, 2 August 2025 (UTC)
WikiWiX.com
[edit]wikiwix.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot-Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
WikiWiX is was an archiving platform which has gone defunct. The domain registration expired at the end of April this year. It was used on 4,291 articles on enwiki (not as bad as the 39,851 on frwiki). Can anything be done to switch to an alternative archive? In at least one case I've seen WikiWiX used to archive a web.archive.org link. Cabayi (talk) 14:46, 21 July 2025 (UTC)
- hat tip to ticket:2025072010002392. Cabayi (talk) 14:48, 21 July 2025 (UTC)
I'm glad you brought this up. On French wiki it is actually ingrained through a custom patch to MediaWiki that adds an "[archive]" link automatically next to every external URL (eg fr:Aubelin_Jolicoeur in the Sources section). It started as a small non-profit that had external funding. Recently they lost their sponsor and the owner took it over as a personal project. He then got hit badly by DoS (or AI scrapers) and have had trouble keeping it up. The main tech guy left, making it a one-man operation. The site reliability is terrible going up and down. Attempts by the French community to escape this trap have been unsuccessful because the owner has a bunch of supporters typical of small wiki politics. Nobody seems willing or able to get rid of WikiWix, so long as the owner keeps making promises and telling positive stories.
On English Wikipedia, I made previous attempts to convert WikiWix links to Wayback Machine. There are probably new ones added since then. The links also exist in the IABot database - I also unwound most of those where possible but they still exist propagating through 300+ wikis via IABot. I recall it was a difficult operation, but I also wrote a lot of code for it, so maybe that code can still be applied.
It's unclear the site is actually defunct. Until the owner literally says so, the site has a tendency to keep popping back up in hobbled form. I would like nothing better than definitive proof of defunct. -- GreenC 16:14, 21 July 2025 (UTC)
- FWIW the registrar still propagates, the domain is in the root servers.
dig +trace wikiwix.com
. Not sure why who.is information says expired. The ICANN tool can't connect to the registrar, and the registrar's whois.ovh.com doesn't work. Odd setup. -- GreenC 18:00, 28 July 2025 (UTC)
- FWIW the registrar still propagates, the domain is in the root servers.
680news.com
[edit]Most of these links that I tested redirect to links at citynews.ca. However, the redirect at De Havilland Canada DHC-3 Otter doesn't work and has no new url. I request that the redirects be done first then archives for the ones that don't. ~190 Thank you! MrLinkinPark333 (talk) 05:46, 27 July 2025 (UTC)
Enwiki
- Checked 194 pages and edited 146 pages. Moved 135 links to a new URL: 47 normal redirects, 86 ruled mapped redirects, 2 ghost mapped redirects, Resolved 4 soft-404s. Added 5
{{dead link}}
. Switched 1|url-status=live
to dead. Added 12 archive URLs (11 Wayback).
Done -- GreenC 04:02, 3 August 2025 (UTC)
statto.com
[edit]- Moved from User:GreenC talk page
Hi, Many football-related Wikipedia pages currently link to archived versions of my website (https://www.statto.com/), which was offline for a while. It’s now fully restored, and the original content is live again. Since I can’t use a bot or edit all the pages manually, I was suggested that you can help update those Webarchve links back to the original URLs. Please let me know what’s possible or how best to proceed. Ggvanncaa (talk) 07:59, 30 July 2025 (UTC)
- Ggvanncaa: I should be able to. You are in the queue. 2,641 pages -- GreenC 15:22, 30 July 2025 (UTC)
Enwiki
- Checked 2,641 pages and edited 1,745 pages. Made live 2,784 URLs. Removed 45
{{dead link}}
. Added 1 archive URLs (0 Wayback).
Done -- GreenC 04:55, 3 August 2025 (UTC)
anandtech.com
[edit]The article archives for the AnandTech website have been removed, and links to AnandTech or its article archives are now being redirected to the AnandTech forums front page. Links to AnandTech articles on Wikipedia need to be modified to go to an archive site. Jesse Viviano (talk) 06:05, 5 August 2025 (UTC)
Enwiki
- Checked 1,191 pages and edited 1,171 pages. Added 14
{{dead link}}
. Switched 707|url-status=live
to dead. Added 2,395 archive URLs (2,049 Wayback).
IABot DB
- Checked and updated 2,551 unique URLs
Done -- GreenC 20:47, 6 August 2025 (UTC)
belediyye.org
[edit]This now redirects to a website for a New Jersey state representative, for doubtlessly inscrutable reasons, so link templates should be updated accordingly. --Slowking Man (talk) 16:03, 5 August 2025 (UTC)
Not done only because there was nothing to do. It was already done. Of the 1,461 citations, 1,459 have the same URL. -- GreenC 23:53, 6 August 2025 (UTC)
musicline.de
[edit]Musicline.de used to host the Germany music charts. Although these chart positions can be found at offiziellecharts.de, they can't be converted over because they have an numerical id in the URL. Some pages already have archives. Not sure if The Fixer (song) can be fixed because it's in a wikitable. ~1400. Thanks! MrLinkinPark333 (talk) 02:32, 8 August 2025 (UTC)
Enwiki
- Checked 1,468 pages and edited 570 pages. Added 60
{{dead link}}
. Switched 37|url-status=live
to dead. Added 591 archive URLs (530 Wayback).
IABot DB
- Checked and updated about 12,000 unique URLs
Done -- GreenC 05:09, 18 August 2025 (UTC)
abs-cbn.com
[edit]This website has a mixture of things that need fixing.
- 1) Urls with domain names, like entertainment.abs-cbn.com, are redirecting to soft 404s.
- 2) Links that start with https://abs-cbn.com or http://abs-cbn.com/ are broken. No luck with adding www in front of the URL.
- 3) Links that start with http://www.abs-cbn.com are redirecting to new live links at https://www.abs-cbn.com.
~2000 articles. I filtered out the Https www links as they seem to be working. Thanks! MrLinkinPark333 (talk) 01:25, 9 August 2025 (UTC)
- I found 7,700 without https://www .. but the entire domain is not much more 9,334, I might as well do them all see what turns up. -- GreenC 01:02, 20 August 2025 (UTC)
- I see you found a lot more than I thought. If you feel like doing them all, go for it! MrLinkinPark333 (talk) 01:07, 20 August 2025 (UTC)
Enwiki
- Batch 1: Checked 4,000 pages and edited 3,428 pages. Moved 9,023 links to a new URL: 7,463 normal redirects, 1,523 ruled mapped redirects, 37 ghost mapped redirects, Resolved 2,187 soft-404s. Removed 25
{{dead link}}
. Added 209{{dead link}}
. Switched 456|url-status=dead
to live. Switched 223|url-status=live
to dead. Added 2,000 archive URLs (1,912 Wayback). - Batch 2: Checked 5,748 pages and edited 4,997 pages. Moved 13,138 links to a new URL: 10,913 normal redirects, 2,137 ruled mapped redirects, 88 ghost mapped redirects, Resolved 3,326 soft-404s. Removed 6
{{dead link}}
. Added 235{{dead link}}
. Switched 474|url-status=dead
to live. Switched 421|url-status=live
to dead. Added 2,530 archive URLs (2,476 Wayback).
IABot DB
- Checked 15,000 URLs and updated about 2,867
Notes
- The soft-404s in this case were difficult because while the page content was displaying a "home" page, the URL itself didn't redirect, so it required foreknowledge of HTML keywords to know when the page landed on a soft-404. There were about 3 dozen different "home" page variations to be discovered. For this reason I had to redo Batch 1 a couple times before it was ready for upload.
Done -- GreenC 21:04, 24 August 2025 (UTC)
1up.com
[edit]The article archives for the 1UP.com website as seen in 1Up Network have been removed, and links to 1UP.com or its article archives are now being redirected to the IGN front page. Links to 1UP.com articles on Wikipedia need to be modified to go to an archive site. Jesse Viviano (talk) 09:29, 10 August 2025 (UTC)
Enwiki
- Checked 3,254 pages and edited 618 pages. Added 20
{{dead link}}
. Switched 440|url-status=live
to dead. Added 332 archive URLs (263 Wayback).
IABot DB
- Checked and updated 6,761 URLs
Done -- GreenC 21:37, 25 August 2025 (UTC)
collider.com
[edit]Urls ending with /number/ can be fixed by removing the number and ending slash of the URL. This is now here for Anne Hathaway. I do not know how to extract these URLS as the number id's are at the end of the URL. Ive only found that one and this for Bruce Willis so far from the 10k overall links. May only be a handful to convert. In any case, thank you! MrLinkinPark333 (talk) 03:25, 13 August 2025 (UTC)
- Are you doing the entire domain? I only requested the ones with numbers at the end. The rest can be filtered out. MrLinkinPark333 (talk) 22:04, 25 August 2025 (UTC)
- It's OK, I'm here, might as well check them. It didn't take long because most return 200 or simply convert to https. As for filtering, I don't know how with insource, need to load all links anyway. Also, it's usually best to check the entire domain because it creates more data for soft-404 algorithms to learn from, and there are usually unknown-unknowns to be discovered. In this case I discovered this is now here, added the rule and repaired a couple hundred. I won't do IABot because there isn't much that can be done, IABot doesn't support URL moves which is mostly what this is (other than 52 archives in 10,000+ URLs). -- GreenC 02:51, 26 August 2025 (UTC)
- I didn't want you to waste your time. However, I see you found more to fix. Thank you! MrLinkinPark333 (talk) 03:35, 26 August 2025 (UTC)
- It's OK, I'm here, might as well check them. It didn't take long because most return 200 or simply convert to https. As for filtering, I don't know how with insource, need to load all links anyway. Also, it's usually best to check the entire domain because it creates more data for soft-404 algorithms to learn from, and there are usually unknown-unknowns to be discovered. In this case I discovered this is now here, added the rule and repaired a couple hundred. I won't do IABot because there isn't much that can be done, IABot doesn't support URL moves which is mostly what this is (other than 52 archives in 10,000+ URLs). -- GreenC 02:51, 26 August 2025 (UTC)
Enwiki
- Checked 10,307 pages and edited 2,312 pages. Moved 2,275 links to a new URL: 78 normal redirects, 2,182 ruled mapped redirects, 15 ghost mapped redirects, Resolved 21 soft-404s. Removed 1
{{dead link}}
. Added 11{{dead link}}
. Switched 470|url-status=dead
to live. Switched 51|url-status=live
to dead. Added 52 archive URLs (49 Wayback).
Done -- GreenC 02:51, 26 August 2025 (UTC)
casetext.com
[edit]Site recently shut down by parent company. ~2300 pages. James (talk/contribs) 05:34, 14 August 2025 (UTC)
Enwiki
- Checked 2,266 pages and edited 2,144 pages. Added 330
{{dead link}}
. Switched 145|url-status=live
to dead. Added 2,391 archive URLs (2,382 Wayback).
IABot DB
- Checked and updated 2,221
Done -- GreenC 14:28, 27 August 2025 (UTC)
sa-ema.com
[edit]This website used to host South African music charts. Now, it's been taken over by a gambling site. Some of the articles already have archived links. I would like to request this site to be added to JUDI. Of the 48 pages, there's a few lists with many URLs to this site. Thanks! MrLinkinPark333 (talk) 20:51, 16 August 2025 (UTC)
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
nzhistory.net.nz
[edit]Following a URL move from nzhistory.net.nz → nzhistory.govt.nz, the former has now been taken over by a gambling website. Could a bot change ".net" to ".govt" in all the outdated refs? Currently looks like ~1,160 pages. Thanks, Nil🥝 03:49, 18 August 2025 (UTC)
Enwiki
- Checked 1,164 pages and edited 1,146 pages. Moved 1,499 links to a new URL: 1,494 ruled mapped redirects, 5 ghost mapped redirects, Resolved 1 soft-404s. Added 3
{{dead link}}
. Switched 70|url-status=dead
to live. Switched 5|url-status=live
to dead. Added 41 archive URLs (38 Wayback).
Done -- GreenC 06:51, 28 August 2025 (UTC)
- Much appreciated, thank you! Nil🥝 01:54, 29 August 2025 (UTC)
mht.maryland.gov
[edit]this moved here. -- GreenC 03:59, 18 August 2025 (UTC)
Enwiki
- Checked 1,434 pages and edited 1,428 pages. Moved 1,529 links to a new URL: 3 normal redirects, 1,525 ruled mapped redirects, 1 ghost mapped redirects, Added 14
{{dead link}}
. Switched 8|url-status=dead
to live. Switched 53|url-status=live
to dead. Added 209 archive URLs (201 Wayback).
IABot DB
- Checked 1,400
Done -- GreenC 05:30, 30 August 2025 (UTC)
China Post
[edit]Chinapost.com.tw was the URL before the publication got bought by Now News and moved to Chinapost.nownews.com. The newspaper no longer exists, making both the old and new urls broken.
Thanks! MrLinkinPark333 (talk) 18:44, 18 August 2025 (UTC)
chinapost.com.tw
[edit]- Enwiki
- Checked 1,207 pages and edited 926 pages. Added 16
{{dead link}}
. Switched 256|url-status=live
to dead. Added 1,125 archive URLs (1,051 Wayback).
- Checked 1,207 pages and edited 926 pages. Added 16
- IABot DB
- Checked 1,180 urls
chinapost.nownews.com
[edit]- Enwiki
- Checked 97 pages and edited 77 pages. Added 7
{{dead link}}
. Switched 13|url-status=live
to dead. Added 66 archive URLs (66 Wayback).
- Checked 97 pages and edited 77 pages. Added 7
- IABot DB
- Checked 114 urls
Done -- GreenC 21:12, 30 August 2025 (UTC)
hyperfun
[edit]There are two urls because this is the second time the project site has moved: 1. cis.k.hosei.ac.jp/~F-rep/ — dead link 2. hyperfun.org — spammer (link to casino on homepage, no impersonation) The second link appears to have been usurped a year or two ago.
I've manually changed a few links (regular and archive) on the pages HyperFun and Function Representation to a site which matches the archive.
Bhbuehler (talk) 20:23, 20 August 2025 (UTC)
Not done - only in a couple pages, and they are already fixed. -- GreenC 21:17, 30 August 2025 (UTC)
a-o-f.org
[edit]Seems to have been usurped by gambling™ since 2008 or so, but it's used crosswiki. Perryprog (talk) 02:11, 21 August 2025 (UTC)
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
idolator.com
[edit]Idolator was part of SpinMedia before it got sold in 2016. The website was last updated in 2022. If any of these links on Wikipedia are live, I request archive copies of them in case the website fully shuts down. Otherwise, archived copies of broken links are appreciated. ~3700 articles. Thanks! MrLinkinPark333 (talk) 02:55, 21 August 2025 (UTC)
- WaybackMachine has no archives available: example. The site requested not to be archived. That leaves archive.today, or any that are still live. The rest will be
{{dead link}}
. -- GreenC 00:55, 31 August 2025 (UTC)
Enwiki
- Checked 3,785 pages and edited 3,225 pages. Moved 10 links to a new URL: 1 normal redirects, 9 ruled mapped redirects, Resolved 28 soft-404s. Added 2,202
{{dead link}}
. Switched 1,171|url-status=live
to dead. Added 1,546 archive URLs (0 Wayback).
IABot DB
- Checked 7,782 URLs
Done -- GreenC 16:58, 31 August 2025 (UTC)
- I'm not sure why anything would specifically ask to not have its pages be archived, but for the record, this January 2025 piece is the most recent thing I could find from them after a few 2024 articles. It's unclear what happened with activity there when there's nothing to be found for 2023. SNUGGUMS (talk / edits) 20:38, 31 August 2025 (UTC)
- I checked each link if it is live, before adding an archive URL or
{{dead link}}
. Probably 5% to 10% of domains request to be removed. The problem is when a company goes defunct, they never lift the block, it has to stay until someone requests to remove it, which never happens, effectively banishing them forever. Except from sites like archive.today which generally don't do archive blocks, but for that reason makes them more vulnerable to legal action and potential takedown. -- GreenC 21:42, 31 August 2025 (UTC)- The idea of getting sued never crossed my mind. If archive.today and other things used to preserve Idolator's entries are all taken down for any reason, then that might render some or all pieces from the site unusable. I hope it doesn't come down to that. SNUGGUMS (talk / edits) 21:57, 31 August 2025 (UTC)
- I checked each link if it is live, before adding an archive URL or
- I'm not sure why anything would specifically ask to not have its pages be archived, but for the record, this January 2025 piece is the most recent thing I could find from them after a few 2024 articles. It's unclear what happened with activity there when there's nothing to be found for 2023. SNUGGUMS (talk / edits) 20:38, 31 August 2025 (UTC)
mediabistro.com
[edit]Found in a GAN spotcheck and redirects to this, which sure as hell looks like a usurpation from what I've seen. See also Wikipedia's redirect MediaBistro, which says the parent company liquidated in 2015. Departure– (talk) 03:35, 21 August 2025 (UTC)
- Okay, apparently the redirect I linked was to a different site, and the one I found before writing this was Mecklermedia, the site's pre-2015 parent company. Departure– (talk) 03:36, 21 August 2025 (UTC)
- Departure–, if I understand correctly, there is nothing to do..? -- GreenC 21:50, 30 August 2025 (UTC)
- The thing is that links that once led to content that at one point verified information now lead to a site that doesn't. I apologize if this is the wrong venue to bring this, but I was hoping this was the place to report link changes of this type. Departure– (talk) 22:03, 30 August 2025 (UTC)
- Departure–, if I understand correctly, there is nothing to do..? -- GreenC 21:50, 30 August 2025 (UTC)
- You are in the right place. I see now what is happening. I'll process the domain. If any are legit live, it will keep them live otherwise the rest look like soft-404s ie. they look like working pages, but redirect to a home page. Those will be converted to archive URLs. -- GreenC 22:14, 30 August 2025 (UTC)
Departure–, the data was showing a connection to AdWeek. I asked Google "has mediabistro.com been bought by adweek.com" it replied:
Acquisition Date: Prometheus Global Media acquired Mediabistro in May 2014 for $8 million. Acquiring Company: The parent company of Adweek, The Hollywood Reporter, and Billboard, Prometheus Global Media, bought Mediabistro. Integration: The acquired Mediabistro sites, including blogs and job boards, were merged into the Adweek Blog Network, with their web addresses now starting with Adweek.com. Result: The Mediabistro brand was absorbed by its parent company, and its content became part of Adweek's offerings
I was able to confirm, for example this moved here (via Adam Housley). A lot of the old mediabistro links are salvageable with the same move. Even top-level links like https://www.mediabistro.com/tvnewser/ are now at https://www.adweek.com/tvnewser/ .. however there are old-style links that I don't know how to save: http://www.mediabistro.com/articles/cache/a1303.asp or http://www.mediabistro.com/articles/details.asp?aID=11835 -- GreenC 17:30, 31 August 2025 (UTC)
Enwiki
- Checked 1,217 pages and edited 959 pages. Moved 876 links to a new URL: 1 normal redirects, 873 ruled mapped redirects, 2 ghost mapped redirects, Resolved 659 soft-404s. Removed 1
{{dead link}}
. Added 5{{dead link}}
. Switched 230|url-status=dead
to live. Switched 19|url-status=live
to dead. Added 281 archive URLs (250 Wayback).
IABot DB
- Checked 1,933 URLs
Done -- GreenC 22:05, 31 August 2025 (UTC)
- Thank you @Departure– for bringing this up and @GreenC for tackling this. Mediabistro was indeed absorbed. It ran the blogs TVNewser (covering US national TV news) and TVSpy (covering US local TV news), so I'm not surprised that there was a link in Gary England, a local TV meteorologist. Sammi Brie (she/her · t · c) 22:27, 31 August 2025 (UTC)
zambiawatchdog.com
[edit]Looks like it went down around April 18th. 70 pages GrapesRock (talk) 17:49, 24 August 2025 (UTC)
Enwiki
- Checked 70 pages and edited 64 pages. Added 3
{{dead link}}
. Switched 2|url-status=live
to dead. Added 75 archive URLs (75 Wayback).
IABot DB
- Checked 75 URLs
Done -- GreenC 01:08, 1 September 2025 (UTC)
groklaw.net
[edit]groklaw.net has been usurped, some content has been removed, other content has apparently had crypto spam added. https://en.wikipedia.org/wiki/Groklaw#Later_history "As of 18 August 2025 the site points to a crypto gambling site."
Much content is available on archive.org DimeCadmium (talk) 14:38, 28 August 2025 (UTC)
Done per WP:JUDI batch #28. GreenC 21:53, 5 September 2025 (UTC)
foxsports.ph
[edit]foxsports.ph is no longer active following the shutdown of Fox Sports Asia. MarcusAbacus (talk) 15:48, 29 August 2025 (UTC)
Enwiki
- Checked 158 pages and edited 103 pages. Added 8
{{dead link}}
. Switched 3|url-status=live
to dead. Added 114 archive URLs (114 Wayback).
IABot DB
- Checked 213 URLs
Done -- GreenC 05:48, 1 September 2025 (UTC)
tiebreakertimes.com
[edit]tiebreakertimes.com is no longer active and was moved to tiebreakertimes.com.ph. It seems that all of the news articles fall under the new URL. MarcusAbacus (talk) 12:00, 30 August 2025 (UTC)
Enwiki
- Checked 68 pages and edited 65 pages. Moved 113 links to a new URL: 1 normal redirects, 112 ruled mapped redirects, Switched 16
|url-status=dead
to live. Added 5 archive URLs (5 Wayback).
IABot DB
- Checked 99 URLs
Done -- GreenC 17:30, 1 September 2025 (UTC)
bbc.co.uk
[edit]Many of these redirect to new URLs at bbc.com. However, some will stay at the current URL like this one. I cannot predict which ones will redirect. As this is a huge site, I have to break it up into multiple requests.
Once these ones are fixed, I'll request the rest. Thanks! MrLinkinPark333 (talk) 22:40, 21 August 2025 (UTC)
- BBC has a very large number of pages on enwiki, probably one of the largest domains across all Wikipedia language sites. Will finish smaller requests first, WP:JUDI is backlogged, then return to look at BBC. It's a good idea to break it up. -- GreenC 21:57, 30 August 2025 (UTC)
- Agreed. Considering there's ~140k overall. For the ones I didn't include above, there's a mixture of broken and new links. Therefore, it's better to do it in chunks. MrLinkinPark333 (talk) 22:51, 30 August 2025 (UTC)
- Preliminary data suggests this is a well maintained and stable site with few surprises. The number and type of changes are relatively minor, like http/https, or adding a "www", that sort of thing, based on existing redirects. The most exotic change is removing ".amp" from URLs added by mobile users. In 500 pages there were 0 dead links. It should go fast. This is nice, I run it and go do something else while it cranks through 10s of thousands of pages making small changes. Hope they don't block the bot is the concern. -- GreenC 17:12, 3 September 2025 (UTC)
bbc.co.uk/news
[edit]- /news/ ~60k
- Enwiki
- Batch 1 (00001-10000): Checked 10,000 pages and edited 1,016 pages. Moved 1,078 links to a new URL: 615 normal redirects, 460 ruled mapped redirects, 3 ghost mapped redirects, Resolved 18 soft-404s. Added 13
{{dead link}}
. Switched 17|url-status=dead
to live. Switched 3|url-status=live
to dead. Added 21 archive URLs (14 Wayback).
- Batch 2 (10001-30000): Checked 20,000 pages and edited 2,156 pages. Moved 2,280 links to a new URL: 1,412 normal redirects, 860 ruled mapped redirects, 8 ghost mapped redirects, Resolved 44 soft-404s. Removed 1
{{dead link}}
. Added 40{{dead link}}
. Switched 31|url-status=dead
to live. Switched 7|url-status=live
to dead. Added 47 archive URLs (37 Wayback).
- Batch 3 (30001-62256): Checked 32,257 pages and edited 3,357 pages. Moved 3,586 links to a new URL: 2,199 normal redirects, 1,374 ruled mapped redirects, 13 ghost mapped redirects, Resolved 70 soft-404s. Added 117
{{dead link}}
. Switched 65|url-status=dead
to live. Switched 9|url-status=live
to dead. Added 70 archive URLs (52 Wayback).
bbc.co.uk/sport
[edit]- /sport/ ~34k
- Batch 1 (00001-05000): Checked 5,000 pages and edited 2,363 pages. Moved 16,827 links to a new URL: 8,659 normal redirects, 8,120 ruled mapped redirects, 48 ghost mapped redirects, Resolved 4 soft-404s. Removed 1
{{dead link}}
. Added 46{{dead link}}
. Switched 139|url-status=dead
to live. Switched 6|url-status=live
to dead. Added 101 archive URLs (81 Wayback).
- Batch 2 (05001-10000): Checked 5,000 pages and edited 2,398 pages. Moved 17,695 links to a new URL: 8,472 normal redirects, 9,179 ruled mapped redirects, 44 ghost mapped redirects, Resolved 2 soft-404s. Added 54
{{dead link}}
. Switched 246|url-status=dead
to live. Switched 11|url-status=live
to dead. Added 114 archive URLs (84 Wayback).
- Batch 3 (10001-34614): Checked 24,616 pages and edited 11,598 pages. Moved 84,277 links to a new URL: 40,253 normal redirects, 43,128 ruled mapped redirects, 896 ghost mapped redirects, Resolved 9 soft-404s. Added 299
{{dead link}}
. Switched 850|url-status=dead
to live. Switched 49|url-status=live
to dead. Added 530 archive URLs (413 Wayback).
Done -- GreenC 21:51, 5 September 2025 (UTC)
Australian Dictionary of Biography
[edit]The Australian Dictionary of Biography blocks many calls from "http://" so can any strings in articles like "http://adb.anu.edu.au/biography", "http://adbonline.anu.edu.au/biogs" or "http://www.adb.online.anu.edu.au/biogs" be altered to https:// please? (the template {{Cite Australian Dictionary of Biography}} was recently modified to add the "s" to https://) DivermanAU (talk) 02:23, 2 September 2025 (UTC)
- Anyone able to assist here? By not having "https" the links to the old site do not redirect to the new site, so user will see a "Page not found" message. Or, as an editor for over 10 years, can I make the changes myself? (I just need a few instructions). DivermanAU (talk) 19:16, 8 September 2025 (UTC)
- I do these requests chronologically and you are next in line (see the "Done" tag above this request). -- GreenC 19:54, 8 September 2025 (UTC)
- @GreenC — Thanks so much for doing this! It really makes a difference to users reading these articles. DivermanAU (talk) 12:52, 10 September 2025 (UTC)
- You are welcome. -- GreenC 03:00, 12 September 2025 (UTC)
adb.anu.edu.au
[edit]- Enwiki
- Checked 7,561 pages and edited 4,600 pages. Moved 5,929 links to a new URL: 115 normal redirects, 5,806 ruled mapped redirects, 8 ghost mapped redirects, Removed 3
{{dead link}}
. Added 26{{dead link}}
. Switched 127|url-status=dead
to live. Added 15 archive URLs (14 Wayback).
- Checked 7,561 pages and edited 4,600 pages. Moved 5,929 links to a new URL: 115 normal redirects, 5,806 ruled mapped redirects, 8 ghost mapped redirects, Removed 3
adb.online.anu.edu.au
[edit]- Enwiki
- Checked 1,533 pages and edited 1,502 pages. Moved 2,038 links to a new URL: 32 normal redirects, 2,006 ruled mapped redirects, Removed 1
{{dead link}}
. Added 1{{dead link}}
. Switched 16|url-status=dead
to live. Added 1 archive URLs (1 Wayback).
- Checked 1,533 pages and edited 1,502 pages. Moved 2,038 links to a new URL: 32 normal redirects, 2,006 ruled mapped redirects, Removed 1
- Thanks again for these fixes, I can see the entire old url has been changed to the new one, which is great. But I can still find 1,519 articles if I search
insource:"http://www.adb.online.anu.edu.au/biogs"
including January 4, Type 26 frigate and Protectionist Party (the two External links) which result in "page not found" when the ADB link is clicked. — DivermanAU (talk) 03:23, 12 September 2025 (UTC)
- Thanks again for these fixes, I can see the entire old url has been changed to the new one, which is great. But I can still find 1,519 articles if I search
- I think you are seeing a backend delay. Looking at Type 26 frigate, the URL in the wikitext is correct. -- GreenC 03:55, 12 September 2025 (UTC)
- Yes, looks like you were right - it's just a delay. I can see those articles are fixed now! Thanks again! — DivermanAU (talk) 05:57, 12 September 2025 (UTC)
- Great. I'll make it done again but if you see anything else, let me know. -- GreenC 14:45, 12 September 2025 (UTC)
- Yes, looks like you were right - it's just a delay. I can see those articles are fixed now! Thanks again! — DivermanAU (talk) 05:57, 12 September 2025 (UTC)
Done -- GreenC 14:45, 12 September 2025 (UTC)
billboard.com/articles/columns
[edit]These links are redirecting to new URLs with various subdomains. Please note the following:
- This need the apostrophe removed to make a redirect to the new URL for Justin Timberlake discography.
- Sometimes the number ID will stay. For example, this goes here for United Kingdom.
If any of these don't redirect, please let me know. I can see if the URL needs adjusting in order to make redirects. Thanks! MrLinkinPark333 (talk) 18:59, 2 September 2025 (UTC)
@MrLinkinPark333: I processed 5,000 pages (not uploaded) and the stats are:
- Checked 5,000 pages and edited 4,983 pages. Moved 10,007 links to a new URL: 6,381 normal redirects, 3,623 ruled mapped redirects, 3 ghost mapped redirects, Resolved 29 soft-404s. Removed 1
{{dead link}}
. Added 16{{dead link}}
. Switched 291|url-status=dead
to live. Switched 54|url-status=live
to dead. Added 108 archive URLs (108 Wayback).
You asked "If any of these don't redirect, please let me know": 108 archives, 54 live to dead, and 16 dead link templates. -- GreenC 04:32, 10 September 2025 (UTC)
- Do you see a pattern in the ones that didn't redirect? If you could post a few examples, I'll see if they got moved to new URLS. MrLinkinPark333 (talk) 17:16, 10 September 2025 (UTC)
- Wikipedia:Link rot/Cases/Billboard 50 sample wayback links. - GreenC 19:40, 10 September 2025 (UTC)
- I found that adding a ending slash to make this redirects to that for Falling Down (Selena Gomez & the Scene song) and Hit the Lights (Selena Gomez & the Scene song). This doesn't work for others. MrLinkinPark333 (talk) 20:05, 10 September 2025 (UTC)
- OK great, I'll add that rule and retry these that it missed. -- GreenC 20:52, 10 September 2025 (UTC)
- To be fair, the URL didn't have a slash at the time. In any case, I didn't find a lot of the other sample links. MrLinkinPark333 (talk) 21:49, 10 September 2025 (UTC)
- I see a problem. In Falling Down (Selena Gomez & the Scene song) [2]. Note the trailing query string "?page=0%2C1" which apparently is causing the redirect to fail. Removed and it works. I've never seen the query cause a redirect fail. Some variation of this query is in 66 pages. I just added a rule to remove it, reprocessed those 66, and only 2 URLs were fixed: the same two. Strange. -- GreenC 02:49, 11 September 2025 (UTC)
- Found that removing that string at Gabe McDonough makes a working redirect. Since it's a bare external link, I'm guessing the bot doesn't pick it up? MrLinkinPark333 (talk) 18:45, 11 September 2025 (UTC)
- I see a problem. In Falling Down (Selena Gomez & the Scene song) [2]. Note the trailing query string "?page=0%2C1" which apparently is causing the redirect to fail. Removed and it works. I've never seen the query cause a redirect fail. Some variation of this query is in 66 pages. I just added a rule to remove it, reprocessed those 66, and only 2 URLs were fixed: the same two. Strange. -- GreenC 02:49, 11 September 2025 (UTC)
- To be fair, the URL didn't have a slash at the time. In any case, I didn't find a lot of the other sample links. MrLinkinPark333 (talk) 21:49, 10 September 2025 (UTC)
- OK great, I'll add that rule and retry these that it missed. -- GreenC 20:52, 10 September 2025 (UTC)
- I found that adding a ending slash to make this redirects to that for Falling Down (Selena Gomez & the Scene song) and Hit the Lights (Selena Gomez & the Scene song). This doesn't work for others. MrLinkinPark333 (talk) 20:05, 10 September 2025 (UTC)
- Wikipedia:Link rot/Cases/Billboard 50 sample wayback links. - GreenC 19:40, 10 September 2025 (UTC)
- That page was not in the list because the URL starts with billboard.com/biz/articles of which there are 1,800 -- GreenC 00:59, 12 September 2025 (UTC)
- Ah. I misread the url. I was going to request billboard.biz in the future. I'll make a request soon as billboard ones have been fixed. MrLinkinPark333 (talk) 05:41, 12 September 2025 (UTC)
- Ok. Billboard is like the BBC a monster domain. -- GreenC 14:44, 12 September 2025 (UTC)
- That's why I'm only focusing on only parts of them :) MrLinkinPark333 (talk) 15:03, 12 September 2025 (UTC)
- Ok. Billboard is like the BBC a monster domain. -- GreenC 14:44, 12 September 2025 (UTC)
- Ah. I misread the url. I was going to request billboard.biz in the future. I'll make a request soon as billboard ones have been fixed. MrLinkinPark333 (talk) 05:41, 12 September 2025 (UTC)
- That page was not in the list because the URL starts with billboard.com/biz/articles of which there are 1,800 -- GreenC 00:59, 12 September 2025 (UTC)
Enwiki
- Batch 1 (00001-05000): Checked 5,000 pages and edited 4,983 pages. Moved 10,009 links to a new URL: 6,381 normal redirects, 3,625 ruled mapped redirects, 3 ghost mapped redirects, Resolved 30 soft-404s. Removed 1
{{dead link}}
. Added 16{{dead link}}
. Switched 291|url-status=dead
to live. Switched 54|url-status=live
to dead. Added 106 archive URLs (104 Wayback).
- Batch 2 (05001-15099): Checked 10,100 pages and edited 10,089 pages. Moved 21,436 links to a new URL: 14,386 normal redirects, 7,034 ruled mapped redirects, 16 ghost mapped redirects, Resolved 52 soft-404s. Removed 1
{{dead link}}
. Added 26{{dead link}}
. Switched 502|url-status=dead
to live. Switched 111|url-status=live
to dead. Added 229 archive URLs (224 Wayback).
Done -- GreenC 14:44, 12 September 2025 (UTC)
independent.ie
[edit]The following domains redirect to various regional pages at independent.ie:
- argus.ie
- corkman.ie
- drogheda-independent.ie
- fingal-independent.ie
- herald.ie
- kerryman.ie
- newrossstandard.ie
- sligochampion.ie
- wexfordpeople.ie
- wicklowpeople.ie
Some examples:
- On Probation of Offenders Act 1907, [3] redirects here, the original content is at [4].
- Sometimes the numbers at the end change. On Great Island Power Station, [5] redirects here, the original content is at [6].
- Sometimes the path segments are kept. On Kevin O'Connor (footballer, born 1995), [7] redirects here, the original content is at [8].
- On Rockchapel, [9] redirects here, the original content is at [10].
There's probably more patterns, I only checked like 20 links. This is my first time here, sorry for any formatting issues. ClumsyOwlet (talk) 18:51, 5 September 2025 (UTC)
- ClumsyOwlet: I would normally say this is impossible. There is no way to map this to that. However there is logic: given the last field "probation-act-for-ducie-after-donation-made" is common to both URLs, make a Google search of the site independent.ie for this common string. It correctly returns a match for the new URL. However, I can't automate Google searches without being blocked. But I can run Google Gemini (AI) queries, and ask it to run Google searches. This loophole works. It's not free, but Google seems OK with it so long a there is payment involved. Whose paying? My boss, The Internet Archive. I did some cost analysis, if I run the query 2,000 times it will cost $3.67 US total. I think we can afford it to repair all these URLs is cheap. This would be a new AI approach never done before. -- GreenC 05:08, 10 September 2025 (UTC)
- AI is not working it is hallucinating too much. I came up with a different solution using "Ruled mapped inferred redirects" (last section) - basically it searches the WaybackMachine index for the common string. It misses some because the URLs are not in the WaybackMachine. I am out of tricks to find those, they will be converted to archive URLs. -- GreenC 20:50, 10 September 2025 (UTC)
- ClumsyOwlet: I would normally say this is impossible. There is no way to map this to that. However there is logic: given the last field "probation-act-for-ducie-after-donation-made" is common to both URLs, make a Google search of the site independent.ie for this common string. It correctly returns a match for the new URL. However, I can't automate Google searches without being blocked. But I can run Google Gemini (AI) queries, and ask it to run Google searches. This loophole works. It's not free, but Google seems OK with it so long a there is payment involved. Whose paying? My boss, The Internet Archive. I did some cost analysis, if I run the query 2,000 times it will cost $3.67 US total. I think we can afford it to repair all these URLs is cheap. This would be a new AI approach never done before. -- GreenC 05:08, 10 September 2025 (UTC)
Enwiki
- Batch 1 (0001-0200): Checked 200 pages and edited 188 pages. Moved 161 links to a new URL: 161 ruled mapped inferred redirects, Switched 31
|url-status=dead
to live. Switched 7|url-status=live
to dead. Added 104 archive URLs (73 Wayback).
- Batch 2 (0201-1508): Checked 1,308 pages and edited 1,227 pages. Moved 988 links to a new URL: 988 ruled mapped inferred redirects, Resolved 1 soft-404s. Added 2
{{dead link}}
. Switched 192|url-status=dead
to live. Switched 75|url-status=live
to dead. Added 701 archive URLs (518 Wayback).
Done -- GreenC 01:51, 11 September 2025 (UTC)
bbc.co.uk misc
[edit]Thank you for finding so many URL replacements for bbc.co.uk. There are 11k left, but not all of them will need fixing:
- URLs that end in .shtml tend to be working, with no changes needed. These pages will primary say that BBC archived the page. However, I found a broken link at Chordate. ~5k
- URLs that are not sport, news, or shtml tend to be working or redirect like this one. ~6k
The main things I see are either changing HTTP to HTTPS or archive fixes. As some of these links already have archived links in the article, this should hopefully be resolved quickly. Thank you again! MrLinkinPark333 (talk) 22:22, 5 September 2025 (UTC)
- Using a different method of searching (SQL query), for #1 it returns over 29,000 pages, and for #2 is 134,000 pages. I think CirrusSearch can't accurately search in this case because if there is /news anywhere in the page it will not be reported. For example a page has two URLs - one with /news and the other not - it will skip the entire page since it contains /news. SQL shows every URL, you can filter and see which pages contain a URL pattern. -- GreenC 15:26, 12 September 2025 (UTC)
- Does it work better if you search with the website name, like this? It's giving me a lot more than I thought. MrLinkinPark333 (talk) 16:09, 12 September 2025 (UTC)
- Better, but same issue:
-insource:"bbc.co.uk/news/"
means if this string appears anywhere on the page, don't list the page, even though there might be other URLs on the page that should be included. According to SQL, the number of pages containing a BBC url is about 150,000. There might be some /news or /sport in that 150,000 but those pages also contain other BBC links. It excludes pages that only contain /news or /sport. Since there is no real difference in how the URLs are processed, I suggest we consider the 150k as the primary set, then break down into smaller batches. It could be a lot of batches. If it runs as well as last time, very large batches are possible then it won't be many. I can start slow with small batches to see what problems come up. -- GreenC 16:41, 12 September 2025 (UTC)- Hmm. I'm not sure which batch to focus on. A lot of them look to be working with no issues. I've found some with various issues:
- Maybe you misunderstood what I wrote. There is no sense separating based on URL path, because every BBC urls needs to be processed. /teach, /sound/ etc.. all of them need to be checked: *.bbc.co.uk/* -- GreenC 20:11, 12 September 2025 (UTC)
- Ah okay. I didn't want you to waste your time. MrLinkinPark333 (talk) 21:05, 12 September 2025 (UTC)
- It's alright so far over 80% of the pages have a change. The results are similar in quality but more in quantity than /news and /sport .. The work is on the computer. It's actually more work to do separate projects because it requires creating a new project, updating configurations, downloading a list of target articles. By keeping it under the same project I only need to start a new batch ("Batch 1", "Batch 2" etc) which is fairly easy. If the projects require different configurations they need separate, but this project it's looking all the same. -- GreenC 00:02, 13 September 2025 (UTC)
- Hmm. I'm not sure which batch to focus on. A lot of them look to be working with no issues. I've found some with various issues:
- Better, but same issue:
- Does it work better if you search with the website name, like this? It's giving me a lot more than I thought. MrLinkinPark333 (talk) 16:09, 12 September 2025 (UTC)
Rolled into whole set
|
---|
=== /teach/ === ~60 redirects === /sounds/ === For some reason, the links works then redirects to a broken page. ~1000. === /dna/ === === /cult/ === Mixture of working and broken. ~700. If you could extract a list of sections to go through, that'd be great. I'd only need the section names, not the URLs. I don't think all of them will need checking. I can then check, and post batches in later requests. I'll just leave the 4 above here, so you can work on other requests. --MrLinkinPark333 (talk) 18:32, 12 September 2025 (UTC) |
- @GreenC: This seems to be changing lots of news.bbc.co.uk links to https when the https actually redirects back to http. Examples: [11] [12] [13] [14] In addition it's changing between news.bbc.co.uk/1/* and news.bbc.co.uk/2/* which seem to randomly redirect to each other. According to
meta
tags the former is theUKFS_URL
and the latter is theIFS_URL
which apparently stand for "UK facing site" and "international facing site", but something is seemingly misconfigured as they now redirect randomly from the same IP. EvenTwist41 (talk) 02:17, 17 September 2025 (UTC)- Thanks. It's on hold. I need to think about how to proceed. -- GreenC 05:11, 17 September 2025 (UTC)
- There are two issues isolated to news.bbc.co.uk : A) https redirects to http B) /1/ redirects to /2/ and other way randomly .. there are also two piles of links: X) URLs already modified listed below. Y) URLs yet to be modified.
- I think for A+X, it is best to leave them alone, it causes no harm, and maybe one day they will properly support https anyway. For A-Y, there is no compelling reason to switch to https. For B-X, this is harmless best left alone. For B-Y, same, best left alone not make any more changes.
- End result: do nothing, except add code to skip processing news.bbc.co.uk going forward, at least when they only change is of type A or B. -- GreenC 20:19, 19 September 2025 (UTC)
- Thanks. It's on hold. I need to think about how to proceed. -- GreenC 05:11, 17 September 2025 (UTC)
- Just saw this on a page I frequently edit. What did your bot do? It changed it from http to https. What was that for? Are you saying that whoever copied the url is wrong? There was nothing wrong with it in the first place! RandomEditorofWiki (talk) 14:01, 27 September 2025 (UTC)
- Right, discussed immediately above.. -- GreenC 17:49, 27 September 2025 (UTC)
- Yes, so why did it matter? What’s the difference between it being http and https? RandomEditorofWiki (talk) 20:24, 27 September 2025 (UTC)
- Right, discussed immediately above.. -- GreenC 17:49, 27 September 2025 (UTC)
Enwiki
- Batch 1 (000001-005000): Checked 5,001 pages and edited 4,130 pages. Moved 9,039 links to a new URL: 673 normal redirects, 8,366 ruled mapped redirects, Resolved 8 soft-404s. Removed 2
{{dead link}}
. Added 72{{dead link}}
. Switched 134|url-status=dead
to live. Switched 95|url-status=live
to dead. Added 660 archive URLs (592 Wayback).
- Batch 2 (005001-041000): Checked 36,002 pages and edited 29,514 pages. Moved 64,641 links to a new URL: 5,324 normal redirects, 59,317 ruled mapped redirects, Resolved 17 soft-404s. Removed 27
{{dead link}}
. Added 637{{dead link}}
. Switched 825|url-status=dead
to live. Switched 440|url-status=live
to dead. Added 5,347 archive URLs (4,619 Wayback).
- Batch 3 (041001-071000): Checked 30,000 pages and edited 24,711 pages. Moved 52,790 links to a new URL: 4,473 normal redirects, 48,317 ruled mapped redirects, Resolved 274 soft-404s. Removed 17
{{dead link}}
. Added 476{{dead link}}
. Switched 784|url-status=dead
to live. Switched 364|url-status=live
to dead. Added 4,792 archive URLs (4,148 Wayback).
- Batch 4 (071001-104000): Checked 33,006 pages and edited 27,145 pages. Moved 57,766 links to a new URL: 5,299 normal redirects, 52,467 ruled mapped redirects, Resolved 462 soft-404s. Removed 8
{{dead link}}
. Added 520{{dead link}}
. Switched 747|url-status=dead
to live. Switched 513|url-status=live
to dead. Added 6,152 archive URLs (5,731 Wayback).
- Batch 5 (104001-134000):
On hold per above -- GreenC 05:11, 17 September 2025 (UTC)
granitehighworld.com
[edit]An HTTP era domain found in Rudolph G. Wilson that is now usurped; I assume it most likely was a high-school newspaper for Granite City, Illinois, but now it seems the type of site that'd go on the spam blacklist, with the Chinese text and the markedly not-high-school-friendly content of the site. Departure– (talk) 20:28, 6 September 2025 (UTC)
- For what it's worth, I don't know if it's cited in any other articles and I'm going to bring the article I found it on to AFD momentarily, but it doesn't hurt to check. Departure– (talk) 20:34, 6 September 2025 (UTC)
- It's only the one article. I did this Special:Diff/1309543267/1309946672 for the record, and this Special:Diff/1309940301/1309946556, that should take care of it. -- GreenC 21:20, 6 September 2025 (UTC)
- p.s. I was able to confirm that Granite High World is on the Granite City High School page as the listed newspaper, so my initial suspicions as to the original source were correct (uncited, but right enough in my book). Departure– (talk) 21:48, 6 September 2025 (UTC)
- It's only the one article. I did this Special:Diff/1309543267/1309946672 for the record, and this Special:Diff/1309940301/1309946556, that should take care of it. -- GreenC 21:20, 6 September 2025 (UTC)
Done seems like -- GreenC 01:06, 19 September 2025 (UTC)
Billboard biz
[edit]These ones are mainly for billboard.biz. I added a related one as well:
- billboard.biz tend to soft 404 redirect to the main page of https://www.billboard.com/pro/ ~3600
- billboard.com/bbbiz/ 7
I found a different domain with bbbiz in the URL, but I'll make it a separate request. Thanks again :) MrLinkinPark333 (talk) 18:46, 12 September 2025 (UTC)
billboard.biz
[edit]Enwiki
- Checked 3,202 pages and edited 1,421 pages. Resolved 6,293 soft-404s. Added 1,142
{{dead link}}
. Switched 1,031|url-status=live
to dead. Added 1,056 archive URLs (854 Wayback).
IABot DB
- MrLinkinPark333: Apparently I already permadead'd many of the .biz links in May 2021 (example). This time through I found another ~200 archive.today links missed. Possibly I wasn't checking for archive.today in 2021. It's interesting that so many more links in 2025 needed updating: 1,142
{{dead link}}
. Switched 1,031|url-status=live
to dead. Added 1,056 archive URLs (854 Wayback). Maybe these links have since died but were active in 2021, maybe my methods in 2021 were inaccurate, or maybe IABot was unable to parse/fix them on-wiki. Anyway, things continue to move in the right direction. -- GreenC 02:19, 21 September 2025 (UTC)
billboard.com/bbbiz/
[edit]I'm going to skip these 7 because it's only 7 easily fixed manually I am falling behind on requests thanks. -- GreenC 02:21, 21 September 2025 (UTC)
Done -- GreenC 03:38, 22 September 2025 (UTC)
yjc.news
[edit]Usurped. Old site of the Young Journalists Club, the new one is yjc.ir.
Examples:
- On Iranian handicrafts, http://www.yjc.news/fa/news/6228037 is now at https://www.yjc.ir/fa/news/6228037/توتن-قایق-یکی-از-صنایع-دستی-سیستان-دریاچه-هامون-چشم-انتظار-حیات-دوباره-آن-است (I just changed .news to .ir in the original link and it turned into the correct link).
- On The Accused Escaped, https://www.yjc.news/fa/news/6397790/%D8%B1%D8%A7%D8%B2-%D8%A7%D8%B5%D8%BA%D8%B1-%D9%81%D8%B1%D9%87%D8%A7%D8%AF%DB%8C-%D9%BE%D8%B3-%D8%A7%D8%B2-%D8%B3%D8%A7%D9%84-%D9%87%D8%A7-%D9%81%D8%A7%D8%B4-%D8%B4%D8%AF-%D9%81%DB%8C%D9%84%D9%85 is now at https://www.yjc.ir/fa/news/6397790/راز-اصغر-فرهادی-پس-از-سال%E2%80%8Cها-فاش-شد-فیلم (Just changed .news to .ir. Also works if you take the "https://www.yjc.news/fa/news/6397790" part of the original link and change .news to .ir.).
- On Heshmatollah Falahatpishe, https://www.yjc.news/en/news/38193 is now at https://www.yjc.ir/en/news/38193/iran-to-claim-compensation-from-us-for-chemical-weapons-victims-mp (Same thing for English).
- On 2022 Hormozgan earthquakes, https://www.yjc.news/fa/amp/news/8162021 is now at https://www.yjc.ir/fa/news/8162021/زلزله-های-پی-در-پی-در-غرب-هرمزگان-زلزله-۵۲-ریشتری-چارک-را-لرزاند-فیلم-و-تصاویر (Removed /amp and changed .news to .ir).
- On List of Esteghlal F.C. managers, https://www.yjc.news/00U4Iq is now at https://www.yjc.ir/fa/news/7166384/وریا-غفوری-سرمربی-موقت-استقلال (URL Shortening. .news to .ir works.)
ClumsyOwlet (talk) 02:53, 13 September 2025 (UTC)
- This will be multi-step. Because the domain name has changed from http://www.yjc.news/fa/news/6228037 --> https://www.yjc.ir/fa/news/6228037 I can do a domain move on existing URLs. It will be configured so any that can't be moved will be set as
|url-status=dead
and archive URL added (or dead link tag). After that is complete, I will add the old domain to WP:JUDI, so those remaining links in the old domain get the usurpation treatment, as part of a future JUDI batch run. That should cover both moving the domain where possible, and the usurpation where a move was not possible. Unfortunately I can't easily do both at the same time as moving and usurpation are different types of processes. -- GreenC 02:31, 21 September 2025 (UTC)
Enwiki
- Checked 136 pages and edited 135 pages. Moved 153 links to a new URL: 153 ruled mapped redirects, Resolved 71 soft-404s. Removed 1
{{dead link}}
. Switched 8|url-status=dead
to live. Added 4 archive URLs (4 Wayback).
IABot DB
- Set domain permadead (IABot does not support URL moves)
Done and updated WP:JUDI -- GreenC 04:14, 21 September 2025 (UTC)
coa.inducks.org
[edit]Hi Can you please change all web links with the domain name "coa.inducks.org" into the domain "inducks.org". There are hundreds if not thousands of them in Wikipedia. Here is an example of what should be done: https://en.wikipedia.org/w/index.php?title=Junior_Woodchucks&diff=1311246207&oldid=1305819590 You can safely change any https URL with domain coa.inducks.org into inducks.org, except in archive.org URLs of course. Lerichard (talk) 08:20, 14 September 2025 (UTC)
- Lerichard: Website reports:
- "Due to a high number of AI bots scrawling our website we've had to take the decision to ask visitors to please log-in or register before browsing this website. We apologize for the inconvenience and hope we will find a better solution in the future."
- Since I don't have a login, it requires a "blind move" ie. switch the URL without verifying. Blind moves are risky, there are usually some links that don't work, but since there are only about 120 pages containing coa links, it is a better option than nothing. If it breaks things let me know I can try to repair. -- GreenC 00:48, 22 September 2025 (UTC)
Done -- GreenC 03:38, 22 September 2025 (UTC)
- Great, thanks! Lerichard (talk) 20:19, 22 September 2025 (UTC)
ted.com
[edit]The old TED video URL format was "http://www.ted.com/talks/talk_name_here.html", which now return 404. The current TED video URL format is: "https://www.ted.com/talks/talk_name_here" with the trailing ".html" removed (and HTTPS). A quick search suggests there could be about 1,000 affected links. UnlikelyEvent (talk) 07:02, 15 September 2025 (UTC)
Enwiki
- Checked 3,195 pages and edited 1,449 pages. Moved 1,786 links to a new URL: 270 normal redirects, 1,482 ruled mapped redirects, 34 ghost mapped redirects, Resolved 43 soft-404s. Added 15
{{dead link}}
. Switched 75|url-status=dead
to live. Switched 3|url-status=live
to dead. Added 72 archive URLs (70 Wayback).
Done -- GreenC 03:37, 22 September 2025 (UTC)
whitehousemuseum.org
[edit]Looks like the domain expired and the site moved to tysto.com according to this blog post.
http://www.whitehousemuseum.org/Something → http://www.tysto.com/Something -- Nintendofan885T&Cs apply 20:52, 15 September 2025 (UTC)
Enwiki
- Checked 69 pages and edited 67 pages. Moved 101 links to a new URL: 101 ruled mapped redirects. Switched 6
|url-status=dead
to live.
IABot DB
- Set permadead (IABot does not support URL moves)
Done -- GreenC 01:32, 23 September 2025 (UTC)
artinfo.com
[edit]572 pages. This domain was usurped. Cherry Cotton Candy 12:14, 16 September 2025 (UTC)
- ω Awaiting next WP:JUDI batch. -- GreenC 01:41, 23 September 2025 (UTC)
bostonmetroopera.com
[edit]A couple of pages. This domain was usurped, and at least 2 pages have links to the currently active malicious site. 2601:19E:8000:A4F0:E9F4:6B28:A0A1:A249 (talk) 17:48, 18 September 2025 (UTC)
- ω Awaiting next WP:JUDI batch -- GreenC 01:43, 23 September 2025 (UTC)
consequenceofsound.net
[edit]Website moved to https://consequence.net/ with their old links redirecting. Came across this one with ?new=true that still redirects. I think ?new=true should be removed as it still works without it. ~4800 articles. Thanks! MrLinkinPark333 (talk) 01:10, 21 September 2025 (UTC)
Enwiki
- Checked 4,789 pages and edited 4,560 pages. Moved 5,250 links to a new URL: 2,769 normal redirects, 2,362 ruled mapped redirects, 119 ghost mapped redirects, Resolved 5 soft-404s. Added 1
{{dead link}}
. Switched 140|url-status=dead
to live. Switched 34|url-status=live
to dead. Added 45 archive URLs (43 Wayback).
IABot
- IABot does not have support for URL moves
Done -- GreenC 01:06, 24 September 2025 (UTC)
vnuemedia.com
[edit]Some of these are Billboard biz links. Unfortunately, they can't be converted like this to that because of the different number ID. ~290. Thank you! MrLinkinPark333 (talk) 01:22, 21 September 2025 (UTC)
stannenj.com
[edit]This used to be a Catholic school, now it's an online gambling blog. Crywalt (talk) 14:02, 25 September 2025 (UTC)
- It's at St._Anne_School_(Fair_Lawn,_New_Jersey) Crywalt (talk) 14:05, 25 September 2025 (UTC)
- Added to WP:JUDI, which is the correct place for this Big Blue Cray(fish) Twins (talk) 07:44, 4 October 2025 (UTC)
- Thanks! Crywalt (talk) 23:42, 4 October 2025 (UTC)
- Added to WP:JUDI, which is the correct place for this Big Blue Cray(fish) Twins (talk) 07:44, 4 October 2025 (UTC)
fimi.com charts
[edit]Fimi.com, the provider of the Italian official albums and singles charts, recently renewed the website.
Links to the albums charts archives changed from https://www.fimi.it/top-of-the-music/classifiche.kl#/charts/1/2023/8 to https://www.fimi.it/top-of-the-music/archivio-classifiche-settimanali/archivio-classifiche-per-settimana/?tipo=2&anno=2023&settimana=8.
Links to the singles charts archives changed from https://www.fimi.it/top-of-the-music/classifiche.kl#/charts/3/2023/8 to https://www.fimi.it/top-of-the-music/archivio-classifiche-settimanali/archivio-classifiche-per-settimana/?tipo=2&anno=2023&settimana=8#tabs-1b (the difference with album chart being the suffix #tabs-1b).
~ 1,900 pages. --Cavarrone 07:29, 28 September 2025 (UTC)
Fishes of Australia
[edit]Bad: (Hard 404) : https://museumsvictoria.com.au/home/species/3305
Works: https://fishesofaustralia.net.au/home/species/3305
NB: https://collections.museumsvictoria.com.au/ seems to be unaffected, so it'll just be:
https://museumsvictoria.com.au/home/species/# -> https://fishesofaustralia.net.au/home/species/#
I can't seem to find a search to give you a rough idea of how many there are, including various fixes such as archive refs, but I think there'll be a few hundreds.
Please could you get your bot to replace these? Thanks Big Blue Cray(fish) Twins (talk) 19:08, 2 October 2025 (UTC)
flycmi.com
[edit]Found on University of Illinois Willard Airport and has a big fat 404 with a banner containing "judi" up top. Departure– (talk) 00:37, 3 October 2025 (UTC)
- Added to WP:JUDI, which is the correct place for this Big Blue Cray(fish) Twins (talk) 07:44, 4 October 2025 (UTC)
villagevoice.com
[edit]Many of these links need to be converted to new URLs like this. This will have to be in batches because it's not all the same method
Non PHP links
[edit]These parts needs removed to create the new URLs. There URLS may need more than one of these points removed:
- Dates: These are /YYYY-MM-DD/ or /YYYY/MM/DD/.
- Sections: These are usually after the date before the article name. They may be after .com like below. [15]
- Sections with Numerical IDs: These are after the end of the URL. [16]
- Apostrophe: [17] is now [18]
- /number/ at the end like this and that
Alternatively:
- Any URLs missing an ending slash at the end needs one like Gaga's link above.
- Commas: These links will most likely need ghost redirects as this is now here.
PHP links
[edit]Majority of the links will redirects. They follow the same rules above with some exception.
- Underscores: These need to change to hyphens while removing the majority of the URL: Converting to that redirects to here
- /issues/: Although this is now here, no luck in converting.
- /specials/: No luck converting articles like this which is already archived.
If you want to go through the entire website, including the working links, ~7k Thank you very much! MrLinkinPark333 (talk) 23:56, 3 October 2025 (UTC)
nyti.ms
[edit]972 pages. Expand web short URL for nytimes.com -- GreenC 18:30, 5 October 2025 (UTC)