Honestly, I find it wild there aren’t more digital archives. It’s really just the wayback machine?
boostedred on
I’ve used The Wayback machine several times for different use cases. I got a lot of value out of it!
jiggrinder on
Now why would they do that ?
banditta82 on
I know the NYT sells access to its back archive, I wonder what % of the remaining 23 do as well. While I have no love for how the AI companies train their models but this reeks of “think of the children”.
EverNeko200 on
What’s the point? AI bots will scrape the content anyways. If not bots, AI companies will just pay users with NYT subscriptions to install extensions that siphon articles from behind paywalls. It would be undetectable and unstoppable.
Maybe they could play the cat and mouse game of obfuscation and passing around PO tokens like YouTube does, but at the end of the day it’s just text on a screen. Way easier to steal than a video.
Blocking Archive.org is a waste of time and energy, because what they’re trying to achieve is futility.
Ok-Comedian-9377 on
It’s me guys. It’s my fault. I’ve been using the way back machine to go to one page in the NYT for a gumbo recipe. Despite memorizing it, I pull it up all the time since it’s got lots of extra info and I like looking at it. Last week, it was gone. No more access. Denied. I did it one too many times. I knew it. So I had to go find a picture of a screen shot I took years ago and then I printed it out and pasted it on the back of a kitchen cupboard door. Sorry I broke the nyt with my gumbo recipe obsession.
AutistcCuttlefish on
The internet Archive should try to find a way to impose access blocks on journalists that work for organizations that forbid archiving their websites.
If you aren’t gonna contribute to the archive you shouldn’t be allowed to freeload off of it for your fiscal benefit.
Rehcraeser on
They would get sued a lot more if there was a history of all their titles/articles. I’ve witnessed it first hand so many times. They make a crazy claim with clickbait, and change it a few days later. Somehow it’s legal to fix it days later, when nobody will see it, and act like they didn’t just manipulate millions of people. They would probably slip up more often if it was all being tracked.
Individual-Result777 on
Internet archive clones should pop up just to cover the news only. thats doable…
ttystikk on
They didn’t want to be caught lying.
[deleted] on
[removed]
Shabooopee on
They don’t want history to see their lie and propaganda
K1TSUNE9 on
They don’t want people remembering what lies they posted about the current administration or anything that reflection them. They want people to forget when they remove it from their site.
Rikudo974 on
they just want to be able to rewrite history without leaving a paper trail. being able to change a headline or delete a failed prediction without anyone calling them out is a dream for corporate news. absolute disgrace for journalism
malakon on
What they could do is make articles scraped by Wayback- not accessible for say 100 days. Then people could not use Wayback for paywall bypass.
MuffinzZ291 on
Hot take; just get rid of AI. The world was so much fucking better without it.
x33storm on
It’s because people are using it for free media. Like media is supposed to be.
But if AI gets the hate, i don’t mind.
synapticrelease on
Seems like the solution is to just create a wayback AI that vacuums up all the need sites because it’s apparently legal to do so.
CaptainBayouBilly on
I hope they employ a proxy to scrape data.
For fucks sake, the Internet Archive is important to humanity.
I wonder if those news sites block openAI or the other thieving LLM scumbags?
roseofjuly on
Oh please, they’re not worried about AI. They just know it’s a way for people to read their content for free and we can’t have that.
Goz_system on
Why does it seem like everyone is against preservation?
vertigonex on
>Control the present, control the past.
>Control the past, control the future.
It wasn’t supposed to be a how-to manual…
supadupanerd on
The Oligarchs that own the news media realized that people were using it to check and verify prior comments or statements and they don’t like being called on their bullshit….
So just STFU you serf and get back to sucking the teat of your chosen news org
huntersam13 on
This makes me think of Winston Smith’s job in 1984.
toasohcah on
Our history is always in danger, a lot of information can just go dark at the hands of American tech. It’d be pretty easy to pump out a bunch of Hollywood block busters portraying the Iran war as a massive success for America on all fronts, completely disregard the genocides occurring in Palestine and the region as conspiracies in the coming decades.
Pump out some textbooks, change the college curriculum or else they suffer funding cuts, etc.
action_turtle on
Of course. Ministry of truth is the only truth
Pedrojunkie on
Lets back up the news to paper… and maybe very small films for long term compact storage…
Suspicious-Yogurt-95 on
It’s the time to turn to independent journalists. How can we trust media owned by oligarchs anyway?
28 Comments
Honestly, I find it wild there aren’t more digital archives. It’s really just the wayback machine?
I’ve used The Wayback machine several times for different use cases. I got a lot of value out of it!
Now why would they do that ?
I know the NYT sells access to its back archive, I wonder what % of the remaining 23 do as well. While I have no love for how the AI companies train their models but this reeks of “think of the children”.
What’s the point? AI bots will scrape the content anyways. If not bots, AI companies will just pay users with NYT subscriptions to install extensions that siphon articles from behind paywalls. It would be undetectable and unstoppable.
Maybe they could play the cat and mouse game of obfuscation and passing around PO tokens like YouTube does, but at the end of the day it’s just text on a screen. Way easier to steal than a video.
Blocking Archive.org is a waste of time and energy, because what they’re trying to achieve is futility.
It’s me guys. It’s my fault. I’ve been using the way back machine to go to one page in the NYT for a gumbo recipe. Despite memorizing it, I pull it up all the time since it’s got lots of extra info and I like looking at it. Last week, it was gone. No more access. Denied. I did it one too many times. I knew it. So I had to go find a picture of a screen shot I took years ago and then I printed it out and pasted it on the back of a kitchen cupboard door. Sorry I broke the nyt with my gumbo recipe obsession.
The internet Archive should try to find a way to impose access blocks on journalists that work for organizations that forbid archiving their websites.
If you aren’t gonna contribute to the archive you shouldn’t be allowed to freeload off of it for your fiscal benefit.
They would get sued a lot more if there was a history of all their titles/articles. I’ve witnessed it first hand so many times. They make a crazy claim with clickbait, and change it a few days later. Somehow it’s legal to fix it days later, when nobody will see it, and act like they didn’t just manipulate millions of people. They would probably slip up more often if it was all being tracked.
Internet archive clones should pop up just to cover the news only. thats doable…
They didn’t want to be caught lying.
[removed]
They don’t want history to see their lie and propaganda
They don’t want people remembering what lies they posted about the current administration or anything that reflection them. They want people to forget when they remove it from their site.
they just want to be able to rewrite history without leaving a paper trail. being able to change a headline or delete a failed prediction without anyone calling them out is a dream for corporate news. absolute disgrace for journalism
What they could do is make articles scraped by Wayback- not accessible for say 100 days. Then people could not use Wayback for paywall bypass.
Hot take; just get rid of AI. The world was so much fucking better without it.
It’s because people are using it for free media. Like media is supposed to be.
But if AI gets the hate, i don’t mind.
Seems like the solution is to just create a wayback AI that vacuums up all the need sites because it’s apparently legal to do so.
I hope they employ a proxy to scrape data.
For fucks sake, the Internet Archive is important to humanity.
I wonder if those news sites block openAI or the other thieving LLM scumbags?
Oh please, they’re not worried about AI. They just know it’s a way for people to read their content for free and we can’t have that.
Why does it seem like everyone is against preservation?
>Control the present, control the past.
>Control the past, control the future.
It wasn’t supposed to be a how-to manual…
The Oligarchs that own the news media realized that people were using it to check and verify prior comments or statements and they don’t like being called on their bullshit….
So just STFU you serf and get back to sucking the teat of your chosen news org
This makes me think of Winston Smith’s job in 1984.
Our history is always in danger, a lot of information can just go dark at the hands of American tech. It’d be pretty easy to pump out a bunch of Hollywood block busters portraying the Iran war as a massive success for America on all fronts, completely disregard the genocides occurring in Palestine and the region as conspiracies in the coming decades.
Pump out some textbooks, change the college curriculum or else they suffer funding cuts, etc.
Of course. Ministry of truth is the only truth
Lets back up the news to paper… and maybe very small films for long term compact storage…
It’s the time to turn to independent journalists. How can we trust media owned by oligarchs anyway?