Web
From Just Solve the File Format Problem
(Difference between revisions)
Dan Tobias (Talk | contribs) (→Markup, documents and data) |
Dan Tobias (Talk | contribs) (→Site dumps, multi-file packaging, and offline reading) |
||
Line 101: | Line 101: | ||
* [[MAFF]] (Mozilla Archive Format) | * [[MAFF]] (Mozilla Archive Format) | ||
* [[Package (Web)]] | * [[Package (Web)]] | ||
+ | * [[Portable Web Publications]] | ||
* [[WARC]] | * [[WARC]] | ||
* [[Webarchive (Safari)]] | * [[Webarchive (Safari)]] |
Revision as of 00:58, 26 October 2015
Formats connected with the World Wide Web, though most of them overlap into other categories; basically everything that can be put in a file format of any sort can be put on the Web, and a multiplicity of types of documents, graphics, audio, video, markup, programming languages, and more, are used there. The Web is a variety of HyperMedia, by far the most successful one.
Blogging and web hosting platforms
Content management systems
Development software
Feeds, syndication, and metadata
- Atom (syndication format)
- hAtom (Microformats)
- hListing (Microformats; product/service listings)
- hMedia (Microformats; image/video/audio metadata)
- hNews (Microformats; news articles)
- hResume (Microformats; resumes/CVs)
- hReview (Microformats; ratings/reviews)
- RDF
- RDFa (linked data in HTML)
- RSS
Markup, documents and data
- Accelerated Mobile Pages (AMP)
- BBCode
- Cascading Style Sheets (CSS)
- Sass (pre-processor for CSS)
- Compressed Markup Language (CML, PQA; used in PalmOS)
- HTML/XHTML
- Markdeep
- Markdown
- MHTML
- Wiki markup
- WML
- WOFF
Program/system-specific files (browser/server/OS/etc)
General
- Web browser files (bookmarks, cookies, configurations, etc.)
- Web server files (server configuration, etc.)
Specific
- Internet Shortcut (Windows)
- webarchive (HTML packaging format used by Apple Safari)
- Webloc (Mac OS X)
Protocols and parameters
- Common Gateway Interface (CGI)
- DNS
- Domain name
- Gopher
- HTTP
- HTTPS
- IP address
- IPV6
- JSON API
- Linked Data Platform
- Mime-type
- QUIC
- SOAP
- TCP/IP
- Tor
- URLs (and URIs, URNs, etc.)
- WAP
Scripts/Applets/Plug-Ins/Frameworks/APIs
- .Net Framework
- ActiveX
- Audio Data API (Mozilla)
- JavaScript / ECMAScript
- MediaStream Processing API
- Microsoft Silverlight
- Open Web App Manifest (.webapp)
- VBScript
- Web Audio API
- WMLScript
Site dumps, multi-file packaging, and offline reading
- HAR (HTTP Archive)
- MAFF (Mozilla Archive Format)
- Package (Web)
- Portable Web Publications
- WARC
- Webarchive (Safari)
- Zeno
- ZIM
Miscellaneous
- Adobe Cross Domain Policy File (crossdomain.xml)
- Canonical Link Relation (rel="canonical")
- Content Security Policy
- Favicon
- Form URL encoding
- Galen (framework for testing responsive sites)
- JSON-LD (JSON for linked data)
- Mark of the Web
- Multipart/Form-Data
- Open Graph protocol
- P3P (Platform for Privacy Preferences)
- Percent-encoding
- rel-author (Microformats; link to author homepage)
- rel-home (Microformats; link to site homepage)
- rel-license (Microformats; link to site license)
- rel-nofollow (Microformats; gets robots to ignore link)
- rel-payment (Microformats; link to payment method)
- rel-tag (Microformats)
- Robots Exclusion Standard (robots.txt)
- Sitemap
- URL shorteners
See also
See also E-Mail, newsgroups, and forums (a number of web-based messaging/social-networking things are there)
External links
History
- Arthur C. Clarke predicts computers and the Internet in 1974
- Tim Berners-Lee discusses Web protocols/formats in Jan 1992
- What the Internet looked like in 1995
- 1996 Internet Step-By-Step Guide video
- 1997 "Kids' Guide to the Internet" video
- 1998: prognosticator predicts "By 2005 or so, it will become clear that the Internet's impact on the economy has been no greater than the fax machine's"
- First posts on famous websites
- Internet Archaeology: Behold the Most Hilarious Abandoned Websites
Privacy and security
- Paranoid Browsing plug-in
- Justdelete.me: Delete your account on lots of web services
- Throw off the spooks by disguising your web traffic
- Disconnect: visualize and block invisible websites that track you
- How's My SSL? (tests the security of your browser)
- Check URL for stuff Websense thinks is malicious or might be subject to filtering for some reason
- Brandis Reassures Australians: Data Retention Laws Only For People Using World Wide Web
Commentary
- How to search the Web like the NSA
- If I could, I would repeal the Internet
- Meet the Hackers Who Want to Jailbreak the Internet
- W3C statement about adding DRM/content protection to web standards
- What Bruce Sterling Actually Said About Web 2.0 at Webstock 09
- To Wash It All Away
- What is still on the web after 10 years of archiving?
Tools and utilities
- Web archiving tools and software
- Three Tools for the Web-Savvy Historian: Memento, Zotero, and WebCite
- Archival Acid Test (tests whether web archivers preserve functionality)
- warcbase: A web archive browser built on HBase
- Ad Detector: flags sponsored content
- Sketchy: takes screenshots and scrapes text from web pages
- Using ImagePlot to Explore Web Archived Images
- BrowserStack screenshot automation
- Memento "time travel" extension for Chrome
- BibSonomy and Memento
- RFC 7089 - proposed HTTP protocol extension for date-time negotiation
- Time travel API documentation
- The Web, Annotated
- WebRecorder: create archival copies of sites you browse
- Carbon dating the Web
- Web Search by the people, for the people
- Builtwith: shows what tools are behind a site
- Ping me when a standard is supported by 90% of browsers
- Resources to Search the Invisible Web
- Download virtual machines running various versions of MSIE
- CodePen: in-browser HTML/CSS/JS editor with instant results
- Netcapsule: Browse old websites in emulated old browsers
- Deslide: Display those web slideshows on one page
Other
- Mozilla's Web Literacy Standard
- Chasing the Cicada: Exploring the Darkest Corridors of the Internet
- The average lifespan of a webpage (2011)
- A longitudinal study of Web pages continued: a consideration of document persistence (2004)
- The Conservatives' website purge has destroyed history
- Front end of gov.uk site, downloadable from Github
- Downworthy: A browser plugin to turn hyperbolic viral headlines into what they really mean
- Flex the power of cURL for web dev testing and data transfers
- mcurl: Command line memento client to retrieve old versions of web pages from archives
- Surfing modern web with ancient browsers
- Web at 25
- Restoring a 14 year old website
- Internet Archive 404 Not Found Handler
- Web archiving policies of various archives and national libraries
- Information Search in Web Archives (academic thesis)
- Bookmarklets are dead... we just don't know it yet