The Library of Congress attempt to preserve Twitter’s first four years of...
The Library of Congress attempt to preserve Twitter’s first four years of existence has resulted in an archive of 107 billion public tweets, but one that is inaccessible, the LOC announced in a report last week (http://xrl.us/bn9eos). The LOC has…
Sign up for a free preview to unlock the rest of this article
Timely, relevant coverage of court proceedings and agency rulings involving tariffs, classification, valuation, origin and antidumping and countervailing duties. Each day, Trade Law Daily subscribers receive a daily headline email, in-depth PDF edition and access to all relevant documents via our trade law source document library and website.
reached its “first objectives” which were “to acquire and preserve the 2006-10 archive; to establish a secure, sustainable process for receiving and preserving a daily, ongoing stream of tweets through the present day; and to create a structure for organizing the entire archive by date,” according to the report. However, the archived tweets are not accessible, including to researchers and policymakers, due to “technology challenges,” the report continued. “It is clear that technology to allow for scholarship access to large data sets is lagging behind technology for creating and distributing such data,” and “executing a single search of just the fixed 2006-2010 archive on the Library’s systems could take 24 hours,” the report said.