The Wiert Corner – irregular stream of stuff

Jeroen W. Pluimers on .NET, C#, Delphi, databases, and personal interests

  • My badges

  • Twitter Updates

  • My Flickr Stream

  • Pages

  • All categories

  • Enter your email address to subscribe to this blog and receive notifications of new posts by email.

    Join 1,676 other followers

Archive for the ‘WayBack machine’ Category

When archiving in the WayBack machine returns error 400: clear your cookies

Posted by jpluimers on 2019/08/16

When archiving pages in the WayBack machine, despite Privacy Badger having set to “save no cookies”, it still managed to set truckloads of cookies.

So I used the Chrome settings in chrome://settings/content/cookies to disable cookies and now everything is fine.

–jeroen

Read the rest of this entry »

Posted in Chrome, Google, Internet, InternetArchive, Power User, Privacy, WayBack machine | Leave a Comment »

When saving on the WayBack machine at web.archive.org/save terminates the connection

Posted by jpluimers on 2019/05/27

When you get the response “web.archive.org unexpectedly closed the connection” without even returning an HTTP code, but:

  • it works in anonymous mode
  • it works with all extensions turned off

then likely there are too many cookies for archive.org or/and web.archive.org: in my case, I had 90 cookies.

Cleaning these cookies out resolved the problem (I used awesome cookie manager for this).

–jeroen

Posted in Chrome, Google, Internet, InternetArchive, Power User, WayBack machine | Leave a Comment »

When +Google Nederland maps only fills one or part of the map tiles…

Posted by jpluimers on 2019/04/18

I still have to do this every few weeks on all my desktop machines: [WayBack] When +Google Nederland maps only fills none or part of the map tiles… – Jeroen Wiert Pluimers – Google+

When +Google Nederland maps only fills none or part of the map tiles at https://maps.google.nl, but https://maps.google.com works fine, then remove any gsScrollPos cookies from www.google.nl.

I need to do this every couple of days to keep maps.google.nl working.

Later I also found it can happen for YouTube, then did more digging for gsScrollPos and found a better workaround: [WayBackAwesome Cookie Manager where you can just delete the gsScrollPos cookies from all sites in one go.

Even later I found out that this can be one of the causes for the WayBack machine giving an error 400 when archiving. A more common reason however is that many archived web-pages try to create cookies in the web.archive.com subdomain resulting in the same problem.

The cause seems to be the Great Suspender plugin which should be fixed by now, but might not automatically update to the latest version. See:

Pending a new Great Suspender release, below is a quick way to manually remove them if you are into SQL scripting for sqlite. It basically comes down to executing the below statement when Chrome is closed:

delete from cookies where name like 'gsScrollPos-%'

–jeroen

Posted in Chrome, Google, GoogleMaps, Internet, InternetArchive, Power User, WayBack machine | Leave a Comment »

GitHub – ArchiveTeam/googleplus-grab: Archiving Google+.

Posted by jpluimers on 2019/03/18

Soon this is a thing of the past, but for just a few more days, you can help: Archiving Google+.

Either run this project: [WayBack] GitHub – ArchiveTeam/googleplus-grab: Archiving Google+.

Or even better: run the appliance, and help the WayBack machine with any archiving projects setup by the virtual appliance: the [WayBack] ArchiveTeam Warrior – Archiveteam.

See some of their other pages for more background information:

You can donate both to the archive team, and the internet archive:

How is G+ archiving doing?

The tracker is well under way: [WayBack] Googleplus tracker Dashboard. History: archive.is 1; archive.is 2

Viewing is simple, but what about saving?

[WayBack] If You See Something, Save Something – 6 Ways to Save Pages In the Wayback Machine | Internet Archive Blogs

The WayBack machine has a few APIs (see[WayBack] Wayback Machine APIs | Internet Archive), and includes support for [WayBack] JSONP – Wikipedia.

Example URLs for the most simple API:

 

Removing content

If you want to remove your own content from the WayBack machine, you can: [WayBack] How do I remove an item page from the site? – Internet Archive Help Center

More

[WayBack] Saving of public Google+ content at the Internet Archive’s Wayback Machine by the Archive Team has begun : plexodus

–jeroen

via: [WayBack] The archiving of public Google+ content to the Internet Archive by the Archive Team has has begun. What does this mean, how does this affect you, and w… – Jeroen Wiert Pluimers – Google+

Posted in Development, G+: GooglePlus, Google, InternetArchive, Power User, Python, Scripting, SocialMedia, Software Development, WayBack machine | Leave a Comment »

The [WayBack] and [Archive.is] links in my blog and G+ stream

Posted by jpluimers on 2018/12/13

Answering a good question on [WayBack] Jeroen Wiert Pluimers – Google+:

Q:

Kevin Powick: Why do so many of your posts have [WayBack] embedded in the text? I’ve never been able to figure it out.

A:

On the blog they point to WayBack URLs of the link that succeeds it because I found out that web pages nowadays last a lot shorter than they used to be so link-rot has increased big time.

I still need to write a bookmarklet that automates that process for me: verify if a page is in WayBack, if not: try to store it, if it fails, store it in Archive.is, then generate the full HTML for me to put in the blog.

–jeroen

 

Posted in Blogging, Power User, SocialMedia, WayBack machine | Leave a Comment »

 
%d bloggers like this: