The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 2,810 other followers

Archive for the ‘archive.is / archive.today’ Category

Archive.is blog: Twitter archival is slow, so limit the number of tweets you save in it

Posted by jpluimers on 2022/03/04

[Wayback] Archive.is blog — The website has been slow for some time when…

The website has been slow for some time when archiving Twitter pages, but works fine with other websites. Is there a reason for that? Thx!

Anonymous

1. There are too many pages from Twitter in the queue, which reduces their priority (if it wasn’t for this condition, it would slow everything down)

2. Twitter API sometimes responds with “429 Too Many Requests” or other error, so it usually takes more than 1 attempt to capture the page.

I would suggest refraining from saving pages from Twitter for now, especially those people trying to save dozens or hundreds of tweets

–jeroen

Posted in archive.is / archive.today, Internet, Power User, SocialMedia, Twitter | Leave a Comment »

Archive.is is more like a thread unroll service than an archival service

Posted by jpluimers on 2022/02/14

An interesting take a while ago on [Wayback] Archive.is blog — People often compare various features of…

People often compare various features of archive.is to those of archive.org being mistaken by name similarity (and recently added “save a page” function to archive.org).

This project is different in at least two respects:

  1. We have no goal to save the entire Internet. Only manually submitted pages which may be deleted/altered soon. We are about 100x smaller than archive.org in the storage space (700TB vs. 70PB) and expenses (X,000 $/mo vs. X00,000 $/mo).
  2. The pages are not saved in their network form. Archive.today launches real browsers (not even headless) and tries to load lazy images, unroll folded content, login into accounts if prompted with login form, remove “subscribe our maillist” modals, … So archive.today is not suitable for making notarized or digitally signed snapshots.

It would be more correct to compare it with other thread unrollers.

The RSS feed of blog.archive.today is at blog.archive.today/rss

Read the rest of this entry »

Posted in archive.is / archive.today, Bookmarklet, Internet, InternetArchive, Power User, Web Browsers | Leave a Comment »

 
%d bloggers like this: