The amusing thing is that most of the posters here don't have a problem with Twitter but with stupid Twitter users; guess what, there are stupid users on message boards, IRC and everywhere so don't bash on a medium because of what the users are doing. This doubly when
you are the one who decides who is worthy of being followed, unlike a message board without an ignore function where you get to read all the posts no matter who made them.
As for the archive, it also seems that you don't get that the context for a period is very important: it will be very useful when looking back at these times to be able to have as much data as possible from as many sources as possible in order to establish that context (for example compare the opinions the press had when Obama was elected with those posted on social media services). This goes hand in hand with the severe lack of context-generating information for the past, where we've just started digging for more data to make sense of it all (from letters sent by a mill owner to receipt stubs).
Also, 140 characters occupy roughly 4KB of space on disk (
really this is just 140 bytes so the following math is severely inaccurate):
4 KB per tweet * 75 million (the current number of Twitter users) = 300 GB
We now need to extrapolate how many accounts are active and tweeting, which is a little harder to do. A study done by Sysomos showed that 5% of users accounted for 75% of all activity (out of which 32% of these were bots but we'll ignore that for now)
That means that the total number of active users is actually around:
5/100 * 75 million = 3 750 000
So let's say an average of about 10 tweets per day for these active users, that means
4KB * 3 750 000 = 15 GB/day
Assuming constant tweeting (which never happens, but let's just assume), we get
4KB * 3 750 000 * 365 = 5.475 TB per year for these 3 750 000 users.
This is a highly inflated number because we will store those tweets as just 140 bytes and not 4KB (this number was me writing a 140 character text and saving it in notepad as UNICODE); if we were to just do that math, we'd end up with:
140 bytes * 3 750 000 * 365 = 191.6 GB per year for the top posters.
With this in mind I seriously have a problem understanding how StealhKnight managed to get to those numbers:
StealthKnight: If every user was to post every day and send 55 million messages then it would equal 772092.7897 TB a day. All the messages per day probably figure around to 7.52974 GB. If this were done every day, congress would need 2748.3551 GB for a years worth of twitters. They would have to dedicate a room for twitter messages that are collected in a lifetime.
I would love to hear the math behind that and the numbers it is based on.
And yes, I am one of the people that find a lot of value in Twitter and defend it whenever I find useless bashing.
The thing I hate the most is that people don't say that they don't get Twitter, it's that they say it's useless, without appending the magical words "for me", as in "Twitter is useless for me".