archive.org tools and stuff

http://machawk1.github.io/wail/

upload a site to archive.org

http://archiveteam.org/index.php?title=Internet_Archive#Uploading_to_archive.org

1.crawl save each webpage seperately

 wget http://web.archive.org/save/

2.https://github.com/n0tan3rd/wail

make warc file and upload to archive.org

http://archiveteam.org/index.php?title=Frequently_Asked_Questions

I uploaded a WARC file but why doesn’t it show up in Wayback Machine?

To ensure content integrity, items with WARC files must have the mediatype set to “web” and be under the Archive Team collection in order for it to be ingested by the Wayback Machine.

 

locally browsing with warc file –

https://github.com/webrecorder/webrecorderplayer-electron

 

Tools 

https://webrecorder.io/

http://rhizome.org/software/

https://ws-dl.cs.odu.edu/Main/Software

http://archive.org/help/abouts3.txt – s3 api key -> http://archive.org/account/s3.php

 

download a site/page from archive.org for given time

modify and use this script

 

https://gareth.halfacree.co.uk/2013/04/bulk-downloading-collections-from-archive-org

 

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

w

Connecting to %s