this post was submitted on 24 Sep 2024
259 points (98.1% liked)

Technology

58287 readers
7502 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
all 25 comments
sorted by: hot top controversial new old
[–] Cenotaph@mander.xyz 116 points 4 days ago (1 children)
[–] wewbull@feddit.uk 38 points 4 days ago (1 children)

...and everybody was shocked! Absolutely shocked.

[–] General_Effort@lemmy.world 1 points 4 days ago (1 children)

Shocked? You'd think all the people outraged at having their websites scraped would be delighted. That's probably the real reason for this.

[–] subignition@fedia.io 2 points 4 days ago (1 children)

It's not the scraping itself, but the purpose of the scraping, that can be problematic. There are good reasons for public sites to allow scraping.

[–] General_Effort@lemmy.world 1 points 4 days ago

I have the distinct impression that a number of people would object to the purpose of re-hosting their content as part of a commercial service, especially one run by Google.

Anyway, now no one has to worry about Google helping people bypass their robots.txt or IP-blocks or whatever counter-measures they take. And Google doesn't have to worry about being sued. Next stop: The Wayback Machine.

[–] TheReturnOfPEB@reddthat.com 19 points 4 days ago (1 children)

google is just gonna slowly fade away like some bad early sci-fi teleporter schtick

[–] WoahWoah@lemmy.world 10 points 4 days ago

It's a 2-trillion-dollar company, I think news of their coming demise has been exaggerated.

[–] urquell@lemm.ee 3 points 4 days ago (2 children)
[–] frezik@midwest.social 26 points 4 days ago

They used to have a "cache" link on search results. It occasionally came in handy when the original site was down or changed their link or something.

[–] WindyRebel@lemmy.world 5 points 4 days ago

It was a tool to see what Google has cached, to check web pages for changes based on Google’s last access.

It also had a nice habit of bypassing those pop-ups that would prevent scrolling. 😂

[–] bassomitron@lemmy.world 124 points 4 days ago (1 children)

I was super annoyed when they first took away the links. "Pages are more dependably available now," is such a lazy excuse. Storing the cached content probably wasn't even that expensive for them, as it didn't retain anything beyond basic html and text. Their shitty AI-centric web search was likely the main reason for getting rid of it.

[–] _haha_oh_wow_@sh.itjust.works 61 points 4 days ago (1 children)

Google sure does love killing things people love.

[–] Kolanaki@yiffit.net 5 points 4 days ago (1 children)

"Introducing Google Pets"

Noooooooo!!!

[–] Duamerthrax@lemmy.world 2 points 2 days ago

A partnership with Delta.

[–] GooberEar@lemmy.wtf 50 points 4 days ago (1 children)

I definitely miss the cached pages. I found that I was using the feature very frequently. Maybe it's just the relative obscurity of some of my hobbies and interests, but a lot of the information online that shows up in search engines seems to come from old forums. Often times those old forums are no longer around or have migrated to new software (obliterating the old URLs and old posts as well).

[–] EncryptKeeper@lemmy.world 5 points 4 days ago* (last edited 4 days ago) (1 children)

If you’re looking for a replacement, there are a lot of similar apps out there you can host yourself (And therefore can’t be killed) or pay a fee to have hosted for you.

https://linkwarden.app/ Is the one I use.

There’s also:

[–] oldfart@lemm.ee 8 points 4 days ago (1 children)

That's not the same at all. Archivebox would do the trick if it was pre-populated with every page Google Search has in its index.

[–] EncryptKeeper@lemmy.world 0 points 4 days ago

Well it is, it just doesn’t go out and do all the caching for you ahead of time, instead it’s on demand. You are right that as far as pre populated alternatives go, it’s just archive.org now.

[–] RustyNova@lemmy.world 20 points 4 days ago (1 children)

At least they are using the internet archive, which is neat

[–] palordrolap@fedia.io 25 points 4 days ago (1 children)

Google's money is a bit scummy these days, and definitely not something that should be relied upon long term, but I hope Google are making some kind of monetary donation.

[–] lud@lemm.ee 7 points 4 days ago* (last edited 4 days ago)

It's unclear if Google is donating anything (It would honestly surprise me if they didn't) but at least archive.org is happy about this feature and they call it an collaboration: https://blog.archive.org/2024/09/11/new-feature-alert-access-archived-webpages-directly-through-google-search/

[–] OutrageousUmpire@lemmy.world 12 points 4 days ago

What a disgrace. This clown show of a company kills things people love and pushes advertising no one wants.

[–] quant@leminal.space 9 points 3 days ago

Another piece of internet history now gone. Perhaps not deleted, but hidden beneath Goggle's own archives until they degrade away.