After its website was crippled for nearly a month by a cyberattack, the Internet Archive announced on Monday that it had restored one of its most valuable services—the Save Page Now feature that allows users to add copies of webpages to the organization’s digital library.

In a social media post, the Internet Archive said web pages that users had attempted to save since October 9 are beginning to be archived now, although it did not provide an estimate for when the process would be completed. So, if you were worried that all of that election coverage was in danger of disappearing, the Archive says it’s handling the backlog. And if you stopped archiving because it was down, get back to work.

The organization had been operating its collection in read-only mode since October 21 as it steadily worked to restore services.

Founded in 1996, the Internet Archive is a nonprofit based in San Francisco that provides access to historic web pages, digitized books, and a variety of other media that it has uploaded through its partnerships with hundreds of physical libraries and other partners.

Its unparalleled collection currently contains 835 billion web pages, 44 million books and texts, 15 million audio recordings, 10.6 million videos, 4.8 million images, and 1 million software programs.

12 points

A friendly reminder to everyone to check out ArchiveBox if you’re looking for a self-hosted archiving solution. I’ve been using it for a while now and it works great; it can be a little rough around the edges at times, but I think it’s a wonderful tool. It’s allowed me to continue saving pages during the Internet Archive’s outage.

permalink
report
reply
1 point

I like ArchiveBox, but in my experience, it kept on running into issues saving pages, and stopped functioning after it worked the first few times. I really wish there was a more streamlined application that did a similar thing somewhere out there.

I’ve been looking at Linkwarden’s page archiving solution, but it crashes whenever I try importing any large number of links, so that’s a bust too.

permalink
report
parent
reply
1 point

What sort of content are you archiving. I was trying to backup Wikipedia at some point and it was just a nightmare.

permalink
report
parent
reply
51 points

“belongs to the USA, and as we all know, this horrendous and hypocritical government supports the genocide that is being carried out by the terrorist state of ‘Israel.'”

Now that’s a false-flag if I ever saw one. Out of all the sites of arms manufacturers, funding operations and troll-farms that directly support Israel, these creeps attack… the Internet Archive?

Tell me another one.

permalink
report
reply
18 points

100%

permalink
report
parent
reply
27 points

We should treasure it and protect it at all cost. This is is also for historical purpose.

permalink
report
reply
24 points

I don’t know what kind of architecture web.archive.org has, but when it was offline, I thought that we should really have something distributed that would allow people to store and host a copy of all websites that are important for them.

permalink
report
reply
2 points

IPFS seems similar to what you’re looking for.

(See: A copy of Wikipedia on IPFS being censorship-resistant, and globally distributed)

permalink
report
parent
reply
8 points

Doesn’t i2p do something similar to this? I don’t know much about it but I remember reading it and thinking that it’s like bittorrent but no one person has the entire file, or something like that.

permalink
report
parent
reply
5 points

didn’t you mean IPFS? I2P is a mixnet like the Tor network.

permalink
report
parent
reply
7 points

100PB on i2p is a funny idea, but it’s not necessarily a bad one.

permalink
report
parent
reply
4 points

TIL it’s I2P and not L2P

permalink
report
parent
reply
7 points

You can save the wacz files?

permalink
report
parent
reply
25 points

This is a concerted effort to change our collective consent https://www.pbs.org/independentlens/documentaries/recorder-the-marion-stokes-project/

permalink
report
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 17K

    Monthly active users

  • 12K

    Posts

  • 542K

    Comments