Twitter

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Links and references)
 
(33 intermediate revisions by 2 users not shown)
Line 2: Line 2:
 
|subcat=E-Mail, newsgroups, and forums
 
|subcat=E-Mail, newsgroups, and forums
 
}}
 
}}
 
 
'''Twitter''' is a popular social-networking and messaging service, accessible through the web and mobile device apps, allowing users to write 140-character messages publicly or privately. Often the messages include hyperlinks, which get sent through [[URL shorteners]] (so they might suffer linkrot if the shortening services go away). Some of the conventions of the service are discussed in the article on [[Hashtags, at-signs, retweets, etc.]]
 
'''Twitter''' is a popular social-networking and messaging service, accessible through the web and mobile device apps, allowing users to write 140-character messages publicly or privately. Often the messages include hyperlinks, which get sent through [[URL shorteners]] (so they might suffer linkrot if the shortening services go away). Some of the conventions of the service are discussed in the article on [[Hashtags, at-signs, retweets, etc.]]
  
 
Of interest to archivers is the fact that, as of late 2012, Twitter has [http://thenextweb.com/twitter/2012/12/16/twitter-has-started-rolling-out-the-option-to-download-all-your-tweets/ started rolling out] a feature to permit users to save their entire tweet history as an archive file.
 
Of interest to archivers is the fact that, as of late 2012, Twitter has [http://thenextweb.com/twitter/2012/12/16/twitter-has-started-rolling-out-the-option-to-download-all-your-tweets/ started rolling out] a feature to permit users to save their entire tweet history as an archive file.
 +
 +
The [https://datatransferproject.dev/ Data Transfer Project] is building a project for moving data between services such as this one.
  
 
Twitter is also one of the engulf-and-devour Internet megacorporations now which has swallowed up, digested, and excreted other Internet services, a 2013 example being [[Posterous]].
 
Twitter is also one of the engulf-and-devour Internet megacorporations now which has swallowed up, digested, and excreted other Internet services, a 2013 example being [[Posterous]].
 +
 +
The unrelated system [[Twister]] is an open-source, encrypted, decentralized implementation of a similar concept to Twitter.
  
 
== Downloaded Twitter archive ==
 
== Downloaded Twitter archive ==
Line 38: Line 41:
 
** '''underscore''': Contains another JavaScript file.
 
** '''underscore''': Contains another JavaScript file.
  
== Links and references ==
+
== TwitPic ==
 +
 
 +
TwitPic, a popular hosting service for pictures used on Twitter until it abruptly shut down in 2014, was not affiliated with Twitter, and in fact its shutdown was because Twitter suddenly objected (after years of its operation) to its use of a name resembling Twitter's. People suddenly had to [http://shkspr.mobi/blog/2013/08/exporting-twitpic-images-python/ export their images] from that service to save them from deletion.
 +
 
 +
== Documentation ==
 +
* [https://dev.twitter.com Twitter API info]
 +
* [https://dev.twitter.com/docs/counting-characters How Twitter counts characters] (official Twitter documentation)
 +
* [https://www.quora.com/What-is-Grailbird Discussion of 'Grailbird' JavaScript object used in Twitter archive]
 +
 
 +
== Software ==
 +
* [http://www.pkcarlisle.com/blog.html#tth Python script to create 'Tweet this' link that doesn't require JavaScript]
 +
* [https://github.com/ChimeraCoder/emojibot Emojibot: Twitter bot that automatically translates your tweets into emoji, via the magic of Mechanical Turk]
 +
* [http://social-feed-manager.readthedocs.org/ Social Feed Manager]
 +
* [https://github.com/gwu-libraries/tweet2elasticsearch A utility for loading tweets into elasticsearch (Python)]
 +
* [https://github.com/gwu-libraries/social-feed-harvester Social feed harvester (Python)]
 +
* [https://github.com/edsu/twarc twarc: A command line tool for archiving Twitter JSON]
 +
* [https://blog.buoyant.io/2016/02/18/linkerd-twitter-style-operability-for-microservices/ linkerd: Twitter-style Operability for Microservices]
 +
* [https://github.com/docnow/hydrator Hydrator: Turn Tweet IDs into Twitter JSON from your desktop!]
 +
 
 +
== Other links and references ==
 
* [http://www.twitter.com/ Twitter (official site)]
 
* [http://www.twitter.com/ Twitter (official site)]
 
* [http://en.wikipedia.org/wiki/Twitter Wikipedia article on Twitter]
 
* [http://en.wikipedia.org/wiki/Twitter Wikipedia article on Twitter]
 
* [http://www.nybooks.com/blogs/nyrblog/2013/jan/16/librarians-twitterverse/ Librarians of the Twitterverse]
 
* [http://www.nybooks.com/blogs/nyrblog/2013/jan/16/librarians-twitterverse/ Librarians of the Twitterverse]
* [https://dev.twitter.com/docs/counting-characters How Twitter counts characters] (official Twitter documentation)
+
* [http://qz.com/74937/how-to-become-internet-famous-without-ever-existing/ How to become internet famous for $68]
 +
* [http://twitter.bug.quietbabylon.com/ Science-fictional Twitter bug report]
 +
* [http://www.alanwood.net/unicode/miscellaneous_symbols.html Miscellaneous symbols in Unicode] (useful to copy and paste for tweets)
 +
* [http://boingboing.net/2013/10/03/twitter-files-for-ipo.html Twitter files for IPO]
 +
* [http://www.businessinsider.com/chris-messina-talks-about-inventing-the-hashtag-on-twitter-2013-11 The Inventor Of The Twitter Hashtag Explains Why He Didn't Patent It]
 +
* [https://blog.twitter.com/2013/forward-secrecy-at-twitter-0 Forward secrecy at Twitter]
 +
* [http://mashable.com/guidebook/twitter/ Twitter Guide Book – How To, Tips and Instructions by Mashable]
 +
* [http://bits.blogs.nytimes.com/2014/03/14/spring-cleaning-who-has-access-to-your-data/ Spring Cleaning Who Has Access to Your Data]
 +
* [http://www.neatorama.com/twaggies/ Twaggies: cartoons inspired by tweets]
 +
* [http://readwrite.com/2014/03/17/twitter-identity-problem-what-is-it Twitter Still Has An Identity Problem Eight Years Later]  
 +
* [http://mashable.com/2014/06/19/twitter-gifs/ How to post GIFs on Twitter]
 +
* [http://blog.embed.ly/post/89265229166/what-twitter-isnt-telling-you-about-gifs What Twitter Isn’t Telling You About GIFs]
 +
* [https://www.yahoo.com/tech/bad-news-twitter-now-shows-you-tweets-from-people-you-95292072389.html Twitter Now Showing You Even More Tweets from People You Don’t Follow]
 +
* [http://inkdroid.org/journal/2014/08/31/on-archiving-tweets/ On archiving tweets]
 +
* [https://blog.twitter.com/2014/building-a-complete-tweet-index Building a complete tweet index]
 +
* [http://library.gwu.edu/scholarly-technology-group/posts/harvesting-twitter-streaming-api-warc-files Harvesting the Twitter Streaming API to WARC files]
 +
* [http://inkdroid.org/2016/04/02/follow/ Twitter API apparently sometimes gives tweets from accounts not followed]
 +
 
 +
[[Category:ZIP based file formats]]
 +
[[Category:Twitter| ]]
 +
[[Category:Social networking]]
 +
[[Category:Web]]

Latest revision as of 01:35, 28 February 2020

File Format
Name Twitter
Ontology

Twitter is a popular social-networking and messaging service, accessible through the web and mobile device apps, allowing users to write 140-character messages publicly or privately. Often the messages include hyperlinks, which get sent through URL shorteners (so they might suffer linkrot if the shortening services go away). Some of the conventions of the service are discussed in the article on Hashtags, at-signs, retweets, etc.

Of interest to archivers is the fact that, as of late 2012, Twitter has started rolling out a feature to permit users to save their entire tweet history as an archive file.

The Data Transfer Project is building a project for moving data between services such as this one.

Twitter is also one of the engulf-and-devour Internet megacorporations now which has swallowed up, digested, and excreted other Internet services, a 2013 example being Posterous.

The unrelated system Twister is an open-source, encrypted, decentralized implementation of a similar concept to Twitter.

Contents

[edit] Downloaded Twitter archive

If you have been given the option to download your Twitter history (it has been given gradually to users, so you may or may not have this option now yourself, but probably will in the future if you don't now), it appears as a button at the bottom of the "Settings" page in your account. Pressing it causes the generation of an archive of your tweets to be queued, and when it is finished (minutes? hours? whenever?) you get e-mailed at the registered address associated with the account with a link to retrieve your archive. There, you can download it as a ZIP archive (tweets.zip) containing this file and directory structure:

  • README.txt: an ASCII text file (with long lines that scroll way off to the right if your text viewer doesn't wrap long lines) giving some information about the format
  • index.html: HTML file which, when loaded in a browser, lets you view your tweets. The tweets themselves aren't actually in this file, but it pulls in a bunch of JavaScript from the subdirectories, which in turn load the tweets from data files.
  • css: Subdirectory with Cascading Style Sheets.
    • application.min.css Stylesheet (formatted in hard-to-read manner with no line breaks)
  • data: Subdirectory with data files.
    • csv: Subdirectory with CSV files.
      • YYYY_MM.csv: A series of files named by year and month with the tweets in the form of comma-separated values (CSV). The columns are: "tweet_id", "in_reply_to_status_id", "in_reply_to_user_id", "retweeted_status_id", "retweeted_status_user_id", "timestamp", "source", "text", "expanded_urls". The timestamp is in UTC time, in the format YYY-MM-DD HH:MM:SS +0000.
    • js: Subdirectory with JavaScript (user-specific, encoding details about the tweets).
      • payload_details.js
      • tweet_index.js
      • user_details.js
      • tweets
        • YYYY_MM.js: A series of files named by year and month with the tweets in JSON form, with a one-line header turning each file into a JavaScript variable assignment. (Strip that line if using the JSON data elsewhere.)
  • img: Subdirectory with graphics.
    • bg.png: A PNG graphic used as a background.
    • sprite.png: A PNG graphic with sprites used by the scripts.
  • js: Subdirectory with JavaScript.
    • application.min.js: Script used in displaying tweets (formatted in a hard-to-read manner with no line breaks).
  • lib: Subdirectory with various 'library' files used by the scripts.
    • bootstrap: various JavaScript, CSS, and graphics.
    • hogan: Contains another JavaScript file.
    • jquery: Contains another JavaScript file.
    • twt: Contains some more JavaScript, CSS, and graphics.
    • underscore: Contains another JavaScript file.

[edit] TwitPic

TwitPic, a popular hosting service for pictures used on Twitter until it abruptly shut down in 2014, was not affiliated with Twitter, and in fact its shutdown was because Twitter suddenly objected (after years of its operation) to its use of a name resembling Twitter's. People suddenly had to export their images from that service to save them from deletion.

[edit] Documentation

[edit] Software

[edit] Other links and references

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox