Web
From Just Solve the File Format Problem
				
								
				(Difference between revisions)
				
																
				
				
								
				| Dan Tobias  (Talk | contribs)  (→Tools and utilities) | Dan Tobias  (Talk | contribs)   (→Other) | ||
| Line 179: | Line 179: | ||
| * [http://blog.archive.org/2013/10/24/web-archive-404-handler-for-webmasters/ Internet Archive 404 Not Found Handler] | * [http://blog.archive.org/2013/10/24/web-archive-404-handler-for-webmasters/ Internet Archive 404 Not Found Handler] | ||
| * [http://netpreserve.org/collection-development-policies Web archiving policies of various archives and national libraries] | * [http://netpreserve.org/collection-development-policies Web archiving policies of various archives and national libraries] | ||
| + | * [http://xldb.di.fc.ul.pt/xldb/publications/Costa:InformationSearchIn:2014_document.pdf Information Search in Web Archives (academic thesis)] | ||
| [[Category:HyperMedia]] | [[Category:HyperMedia]] | ||
Revision as of 02:34, 17 December 2014
Formats connected with the World Wide Web, though most of them overlap into other categories; basically everything that can be put in a file format of any sort can be put on the Web, and a multiplicity of types of documents, graphics, audio, video, markup, programming languages, and more, are used there. The Web is a variety of HyperMedia, by far the most successful one.
Blogging and web hosting platforms
Content management systems
Development software
Feeds, syndication, and metadata
- Atom (syndication format)
- hAtom (Microformats)
- hListing (Microformats; product/service listings)
- hMedia (Microformats; image/video/audio metadata)
- hNews (Microformats; news articles)
- hResume (Microformats; resumes/CVs)
- hReview (Microformats; ratings/reviews)
-  RDF
- RDFa (linked data in HTML)
 
- RSS
Markup, documents and data
- BBCode
-  Cascading Style Sheets (CSS)
- Sass (pre-processor for CSS)
 
- Compressed Markup Language (CML, PQA; used in PalmOS)
- HTML/XHTML
- Markdown
- MHTML
- Wiki markup
- WML
- WOFF
Site dumps and offline reading
Miscellaneous
- Canonical Link Relation (rel="canonical")
- Favicon
- Mark of the Web
- Open Graph protocol
- P3P (Platform for Privacy Preferences)
- rel-author (Microformats; link to author homepage)
- rel-home (Microformats; link to site homepage)
- rel-license (Microformats; link to site license)
- rel-nofollow (Microformats; gets robots to ignore link)
- rel-payment (Microformats; link to payment method)
- rel-tag (Microformats)
- Robots Exclusion Standard (robots.txt)
- Sitemap
- URL shorteners
Program/system-specific files (browser/server/OS/etc)
General
- Web browser files (bookmarks, cookies, configurations, etc.)
- Web server files (server configuration, etc.)
Specific
- Internet Shortcut (Windows)
- webarchive (HTML packaging format used by Apple Safari)
- Webloc (Mac OS X)
Protocols and parameters
- Common Gateway Interface (CGI)
- DNS
- Domain name
- Gopher
- HTTP
- HTTPS
- IP address
- IPV6
- JSON API
- Mime-type
- SOAP
- TCP/IP
- Tor
- URLs (and URIs, URNs, etc.)
- WAP
Scripts/Applets/Plug-Ins/Frameworks
- .Net Framework
- ActiveX
- JavaScript / ECMAScript
- Microsoft Silverlight
- Open Web App Manifest (.webapp)
- VBScript
- WMLScript
See also
See also E-Mail, newsgroups, and forums (a number of web-based messaging/social-networking things are there)
External links
History
- Arthur C. Clarke predicts computers and the Internet in 1974
- Tim Berners-Lee discusses Web protocols/formats in Jan 1992
- What the Internet looked like in 1995
- 1996 Internet Step-By-Step Guide video
- 1997 "Kids' Guide to the Internet" video
- 1998: prognosticator predicts "By 2005 or so, it will become clear that the Internet's impact on the economy has been no greater than the fax machine's"
- First posts on famous websites
- Internet Archaeology: Behold the Most Hilarious Abandoned Websites
Privacy and security
- Paranoid Browsing plug-in
- Justdelete.me: Delete your account on lots of web services
- Throw off the spooks by disguising your web traffic
- Disconnect: visualize and block invisible websites that track you
- How's My SSL? (tests the security of your browser)
- Check URL for stuff Websense thinks is malicious or might be subject to filtering for some reason
- Brandis Reassures Australians: Data Retention Laws Only For People Using World Wide Web
Commentary
- How to search the Web like the NSA
- If I could, I would repeal the Internet
- Meet the Hackers Who Want to Jailbreak the Internet
- W3C statement about adding DRM/content protection to web standards
- What Bruce Sterling Actually Said About Web 2.0 at Webstock 09
- To Wash It All Away
- What is still on the web after 10 years of archiving?
Tools and utilities
- Web archiving tools and software
- Three Tools for the Web-Savvy Historian: Memento, Zotero, and WebCite
- Archival Acid Test (tests whether web archivers preserve functionality)
- warcbase: A web archive browser built on HBase
- Ad Detector: flags sponsored content
- Sketchy: takes screenshots and scrapes text from web pages
- Using ImagePlot to Explore Web Archived Images
- BrowserStack screenshot automation
-  Memento "time travel" extension for Chrome
- BibSonomy and Memento
- RFC 7089 - proposed HTTP protocol extension for date-time negotiation
 
- The Web, Annotated
- WebRecorder: create archival copies of sites you browse
- Carbon dating the Web
- Web Search by the people, for the people
Other
- Mozilla's Web Literacy Standard
- Chasing the Cicada: Exploring the Darkest Corridors of the Internet
- The average lifespan of a webpage (2011)
- A longitudinal study of Web pages continued: a consideration of document persistence (2004)
- The Conservatives' website purge has destroyed history
- Front end of gov.uk site, downloadable from Github
- Downworthy: A browser plugin to turn hyperbolic viral headlines into what they really mean
- Flex the power of cURL for web dev testing and data transfers
- mcurl: Command line memento client to retrieve old versions of web pages from archives
- Surfing modern web with ancient browsers
- Web at 25
- Restoring a 14 year old website
- Internet Archive 404 Not Found Handler
- Web archiving policies of various archives and national libraries
- Information Search in Web Archives (academic thesis)


