Fair Use, the AP, and the Internet: Why the AP Isn’t Wrong
by
on June 13, 2008,
The AP isn't wrong.
For some odd reason, there is a viewpoint in Web 2.0 that as long as everyone seems to be doing it, the laws should be changed, or somehow re-interpreted. High on the list of those laws that the horde is clamoring to overturn are those dealing with copyright infringement and libel.
The issues regarding copyright and fair use rear their collective head fairly regularly when it comes to online content, with the music industry as well as the writing industry trading places as the bitchmeme du jour. This time up, it's the writing industry, and the giant in this David and Goliath argument is the Associated Press, which filed seven DMCA take-down notices against Drudge Retort.
The argument being made is that Drudge Retort is a news aggregator of the same design as Digg or Reddit, and that since the full text of the articles wasn't appropriated (only excerpts), it is covered under Fair Use. Bloggers far and wide are decrying the AP as a big bully misinterpreting copyright law, but I believe the misinterpretation isn't on the AP's part.
News aggregated in Drudge Retort is user-submitted, with the headline usually the link back to the original source. The excerpts posted are again submitted by the submitter, and usually do not reference the originating source. In their arguments, most bloggers are turning to interpretation of the Fair Use sections of copyright law, but not the actual sections themselves.
The actual section of the U.S. copyright law on Fair Use starts in § 107. Limitations on exclusive rights: Fair use, which states:
Notwithstanding the provisions of sections 106 and 106A, the fair use of a copyrighted work, including such use by reproduction in copies or phonorecords or by any other means specified by that section, for purposes such as criticism, comment, news reporting, teaching (including multiple copies for classroom use), scholarship, or research, is not an infringement of copyright. In determining whether the use made of a work in any particular case is a fair use the factors to be considered shall include —
(1) the purpose and character of the use, including whether such use is of a commercial nature or is for nonprofit educational purposes;
(2) the nature of the copyrighted work;
(3) the amount and substantiality of the portion used in relation to the copyrighted work as a whole; and
(4) the effect of the use upon the potential market for or value of the copyrighted work.
The fact that a work is unpublished shall not itself bar a finding of fair use if such finding is made upon consideration of all the above factors.
Note how vague that is, but also, what it includes. A news aggregator is dubiously included under the "criticism, comment" text since it does include a user forum dedicated to discussion of the article. However, you'll notice that the fourth factor includes "the effect of the use upon the potential market."
Bloggers know full well that their content will likely be scraped at some point, some for the good, but most for the bad. The majority of that scraping ends up happening after the article isn't as relevant: days or weeks after it is originally posted. It can alter search ranking on the article, but it can also help, which is why splogs are the enemy but All Things Digital's Voices is not. One exists solely to make money by stealing others' content, while the other exists to highlight content from smaller places than the WSJ.
The AP is (and will) argue that by including enough of the story for Drudge Report's users to understand the original article without having to trouble themselves by reading it, they are devaluing the content. That is a valid argument. Submitters can grab and repost that content faster than any splog bot can scrape it, meaning it can appear almost as quickly as the AP content does.
The length of the excerpts isn't the issue, since while we were taught that seven identical words in a row was plagiarism in school, copyright law doesn't give any specific length. The argument that the full articles weren't reproduced, or that "everyone does it" isn't a valid argument.
Everything in the world can't be free. Arguing that the AP doesn't always attribute doesn't mean that two wrongs make a right. Being too lazy to write your own summary of an article rather than posting an excerpt and grabbing enough of the article so that lazy readers don't have to bother reading the full article is not, and should not, be covered under Fair Use.








