<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Columbia Science and Technology Law Review &#187; Legal Technologies</title>
	<atom:link href="http://www.stlr.org/category/legal-technologies/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.stlr.org</link>
	<description></description>
	<lastBuildDate>Fri, 18 Jun 2010 23:53:03 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Semantic Lawyering: How the Semantic Web Will Transform the Practice of Law (Part 5)</title>
		<link>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-5/</link>
		<comments>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-5/#comments</comments>
		<pubDate>Fri, 23 Apr 2010 20:04:39 +0000</pubDate>
		<dc:creator>Brian Harley</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[semantic web]]></category>
		<category><![CDATA[smart documents]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=931</guid>
		<description><![CDATA[(Links to parts 1,  2, 3, and 4.)
Smart document generation
If giving legal advice is one of the two core skills of legal practitioners, the other is drafting legal documents. No matter what area of the law you practice in, you will need to generate a brief, a lease, a will, a contract, a certificate [...]]]></description>
			<content:encoded><![CDATA[<p><em>(Links to parts <a href="../2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/">1</a>,  <a href="../2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/">2</a>, <a href="../2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-3/">3</a></em>, and <a href="http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-4/">4</a>.<em>)</em></p>
<h1>Smart document generation</h1>
<p>If giving legal advice is one of the two core skills of legal practitioners, the other is drafting legal documents. No matter what area of the law you practice in, you will need to generate a brief, a lease, a will, a contract, a certificate of incorporation—you name it. It is no surprise therefore that ever since PCs were first introduced into law firms, lawyers have been looking for ways of using them to make generating documents faster and easier. Word processors helped, and precedent data banks did too, but the Holy Grail in this field is a system that can generate a complete, airtight first draft of the required legal document at the click of a mouse. The idea of software that can generate standardized legal documents is not new. Software packages that produce documents on the basis of certain specified inputs have been on the market for some time. They range from simple electronic forms or automated cut-and-paste to sophisticated software that can draw on internal definitions and even do a measure of logic checking.<a href="#_ftn1">[1]</a> Most law firms nowadays have in place systems of varying degrees of sophistication to avoid re-inventing the wheel each time a legal document is needed.</p>
<p>The Semantic Web promises to take the evolution of document generation further—much further. Advanced functionality such as checking the internal consistency of a document, or checking for compliance with a specified body of rules can be achieved by a non-semantic application built for that purpose. But where semantic applications will really break ahead of the pack is in their ability to draw on a web of structured online legal data and in their interoperability. Being able to access pre-existing taxonomies and rules will facilitate the task of developers, as much of the “logic” an application needs to process will already have been formalized and tested by a broad, collaborative community.</p>
<p>Furthermore, because the task of developing those taxonomies and applying them to data is an ongoing process, less effort will be needed by individual developers to keep applications up-to-date. Suppose a semantic application checks for consistency of the document with a certain body of rules. If a relevant statute is amended, or a court decision clarifies the interpretation of a given rule, there is no need for developers to update the code of the application to implement the amendments. Whatever authoritative online source of legal rules the application draws on can be updated, and <em>all </em>applications drawing on that source will stay abreast of the latest law, without needing to download an update. Another advantage of using smart data is that generating documents would involve more than just producing a human-readable document. The end product would not be a simple text file. Rather, as we have seen, the document could include metadata encoded in accordance with open, machine-readable standards, referencing online taxonomies and rules that give meaning to the data. This means that any other application, whether proprietary or otherwise, which uses those open standards, will be able to process that metadata, and understand the structure and content of the document. The Semantic Web guarantees interoperability by default, and avoids the problem of “smart” documents that are only smart to users who own a particular proprietary application.</p>
<h1>Executable semantic contracts</h1>
<p>If the content of the contract is machine-readable, parts of it may also be machine-executable: if applications can determine the rights and obligations of the parties to such a “semantic contract,” there is no reason why they could not also process payments, notify the parties when notice of renewal is due, renew the contract on specified conditions, etc. In addition to the efficiencies gained in generating the contracts on the lawyer’s side, semantic documents could yield huge gains on the client side. Rather than manually going through each agreement to determine who owes what to whom, when, and on what conditions, semantic contracts could be fed into software that will do this processing automatically.<a href="#_ftn2">[2]</a> With this technology, therefore, the law firm gets to cut the costs of production (and therefore, eventually, the cost of the service), while the client gets an enhanced product that enables it to cut its costs. Expect demand for semantic contracts and the applications that generate them.</p>
<h1>Plain English vs. metadata</h1>
<p>As we have seen, there are limits to the extent to which the plain-English meaning of legal propositions can be translated into formal rules. However, the considerations relating to these limitations are somewhat different in the case of contracts, because of their nature as private legislation between the parties. Here, rather than translating pre-existing laws, the parties are free to choose to draft their agreements using formalized terms and rules that lend themselves to automated analysis and processing. This raises the question of the relationship between the plain-English meaning of the contract (along with the plain-English laws that govern it) and the possibly divergent machine-readable meaning encoded in the metadata. Conceptually, a contract is an agreement between the parties, and the written contract is simply a memorandum or record of that agreement. The rules of contractual interpretation are concerned with ascertaining what rights and obligations the parties have consented to undertake. If I consent to be bound by a semantic contract, am I consenting to be bound by the plain-English terms only, or would the metadata, and the taxonomies the metadata refers to, also guide the interpretation of the agreement?</p>
<p>To put it another way, if I enter into a semantic contract, and the execution of the machine-executable parts of that contract is not what I expected on the basis of the plain English-wording of the contract, has the contract been breached? Suppose that there is no problem with the application that does the executing, but rather that the divergence is caused by differences between the logical implications of the semantic concepts used in the metadata on the one hand, and the positive laws as understood by lawyers and applied by judges on the other. The conservative answer is that the execution and the metadata that enables it are entirely distinct from the contract itself, and machine-execution is ultimately no different from a human agent performing the contract, properly or improperly. But the contrary viewpoint is that what semantic metadata does is to incorporate meaning by reference to definitions and rules external to the data itself. Is that so different from <a href="http://en.wikipedia.org/wiki/Incorporation_by_reference">incorporation by reference</a> in contract law, for example by referring to terms and conditions on the back of a parking ticket, or including <a href="http://www.iccwbo.org/incoterms/id3045/index.html">Incoterms</a> in international trade contracts? Why should the metadata not influence our interpretation of the contract?</p>
<h1>Meaning vs. meaning</h1>
<p>There are deeper questions at issue here, relating to the fundamental differences between machine-executable computer code and legal norms. The kind of “meaning” encoded using Semantic Web standards is deeply different from the kind of “meaning” you and I express when speaking about the law, or the kind expressed by law-makers in creating the law. I will leave these difficult questions hanging for now, but I will hazard to predict that, as machine-executable contracts gain currency and the idea of automated determination and processing of legal obligations becomes commonplace, those fundamental differences between code and law will begin to blur.</p>
<hr size="1" /><a href="#_ftnref">[1]</a> David Siegel,<em> Pull: The Power of the Semantic Web to Transform Your Business</em>, p. 189.</p>
<p><a href="#_ftnref">[2]</a> <em>See </em>Siegel, p. 190.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-5/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Semantic Lawyering: How the Semantic Web Will Transform the Practice of Law (Part 4)</title>
		<link>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-4/</link>
		<comments>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-4/#comments</comments>
		<pubDate>Wed, 21 Apr 2010 20:03:48 +0000</pubDate>
		<dc:creator>Brian Harley</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=928</guid>
		<description><![CDATA[(Links to parts 1, 2, and 3.)
What can you do with the Semantic Web that you can’t do without it?
The Semantic Web is a powerful way of structuring data and giving it a precise, machine-readable meaning. The most obvious and immediate benefit of semantic technologies is in organizing large quantities of information in a particular [...]]]></description>
			<content:encoded><![CDATA[<p><em>(Links to parts <a href="http://www.stlr.org/2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/">1</a>, <a href="http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/">2</a>, and <a href="http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-3/">3</a></em>.<em>)</em></p>
<h1>What can you do with the Semantic Web that you can’t do without it?</h1>
<p>The Semantic Web is a powerful way of structuring data and giving it a precise, machine-readable meaning. The most obvious and immediate benefit of semantic technologies is in organizing large quantities of information in a particular domain to make it easier to retrieve and analyze. This is reflected in the contexts in which these technologies have already been deployed, such as organizing large online databases of content (e.g. bbc.co.uk, see <a href="http://www.bbc.co.uk/blogs/bbcinternet/2010/02/case_study_use_of_semantic_web.html">here</a>); or facilitating the exchange and analysis of research data (e.g. drug research, see <a href="http://www.w3.org/2001/sw/sweo/public/UseCases/Elsevier/">here</a>). Given the problem of legal information expansion discussed in the <a href="../2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/">first post in this series</a>, using semantic taxonomies and rules to organize the vast universe of legal data is clearly a promising area.<a href="#_ftn1">[1]</a></p>
<p>In this post I will go beyond merely identifying the benefits of better structured data. Rather, I want to consider what really distinguishes the Semantic Web from rival technologies by asking: what can you do with the Semantic Web that you can’t do without it? In attempting to answer this question, I will focus on two kinds of application of the Semantic Web which promise to deliver not just enhanced performance, but may even transform the nature of the legal service involved: semantic legal query systems and, in the next part, smart legal documents.</p>
<h1>Lawyers as optimum retrieval intermediaries</h1>
<p>One of the core tasks performed by lawyers is giving legal advice. Schematically, what lawyers do in carrying out this task is to:</p>
<ol>
<li>identify rules in a vast corpus of laws that are relevant to a given legal query;</li>
<li>interpret their legal meaning, often by considering how different rules interact and how they have been interpreted in the past; and</li>
<li>consider how those rules apply to the specific query.</li>
</ol>
<p>What distinguishes lawyers from the man on the street and what justifies both their holding a license to practice and their charging sizable fees for their services, is their (theoretically) superior ability to carry out each of these tasks. To quote the oft-repeated wisdom, the difference between a lawyer and a layman is not that the lawyer knows the law, but that he knows where to find it. I might add that the lawyer also knows whether there are legal rules for a given problem; how different rules interact (which rules preempt or modify other rules); how to check if a law is still in force or a precedent still good law; how to find an authoritative scholarly interpretation; and perhaps most importantly, the lawyer will have a wide experience through of different factual situations and contexts. In this sense, in the delivery of legal advice, a lawyer acts as an intermediary who ensures optimal retrieval of legal knowledge on behalf of his client.<a href="#_ftn2">[2]</a></p>
<h1>Semantic legal queries</h1>
<p>We have seen how lawyers use search engines and commercial databases to deal with step 1 (identify) much more efficiently than was possible in the days of hard-copy statutes and law reports. However, even though researchers started working on expert legal systems as far back the 1970s (see <a href="http://blog.law.cornell.edu/voxpop/2010/02/15/semantic-enhancement-of-legal-information%E2%80%A6-are-we-up-for-the-challenge/">here</a>), in practice, steps 2 (interpretation) and 3 (applying the law to the query) are still largely carried out by the lawyer. This process is aided by technology only to the extent that the identification step 1 is repeated in sourcing secondary materials to guide interpretation and application of the rules. The smarter data generated on the Semantic Web will enable applications to dig deeper into steps 2 and 3.</p>
<p>Leveraging the higher degree of organization of legal data and the possibility of drawing inferences from the data, a semantic legal query system should be able to do more than merely retrieve information based on keywords selected by a human agent. In a world of perfect formalization, an application could carry out the interpretation and the application steps autonomously. But even in the absence of perfection, it is not unrealistic to suggest that within a few years, if enough smart legal data is available on the web, semantic legal query systems will be able to retrieve not just keyword-relevant documents but all or most of the information necessary to carry out steps 2 and 3. The application will know where to find the law (online); it will analyze the structure of the query and scour available data to determine whether there are applicable rules; it will determine what those rules are and suggest how they interact (perhaps retrieving the rules that govern the interaction); it will check whether the rules are up-to-date and retrieve any amendments or qualifications; and it will search for similar fact patterns, precedents and FAQ entries to clarify the application of the rules.</p>
<p>There are at least two major reasons semantic solutions have more potential than rival technologies to achieve these kinds of results. The first relates to the formal structure of Semantic Web standards: because the use of semantic metadata ensures that items of data have a precise meaning, semantic applications can make reliable inferences on the basis of the data. You need certainty to make inferences, because each step amplifies the uncertainty. Take this syllogism: <em>Oracle is a Delaware Corporation; all Delaware Corporations are legal persons; therefore Oracle is a legal person. </em>Now imagine each proposition in the syllogism is the result of a “best guess” data analysis process (e.g. through statistical analysis): <em>There is a 90% percent chance that Oracle is a Delaware Corporation; there is a 90% chance that all Delaware Corporations are legal persons; therefore there is a 81% (90% of 90%) chance that Oracle is a legal person.</em> This uncertainty compounds with each step, so beyond a few steps, any non-marginal uncertainty is fatal.</p>
<p>With the Semantic Web, if your query specifies a defined entity, the application will know <em>precisely</em> what you are referring to. In principle all instances of that object on the Semantic Web will refer to the same (online) definition, which specifies its properties and its relation to other entities. The second reason for the superiority of semantic applications relates to the openness of Semantic Web standards: the widespread adoption of standards for tagging and organizing legal data will ensure that more structured legal information is available than could possibly be achieved by a single provider of proprietary systems.</p>
<h1>DIY and FAQs</h1>
<p>An application that can deliver a page full of the kind of information described above will go a long way in assisting lawyers in carrying out steps 2 and 3 of legal advice delivery. In fact, if the application is good enough, it may even make the lawyer’s input redundant. How much additional specialist knowledge do you really need if all of the relevant information is right before you? Many consumers of legal services are happy to resort to “DIY” legal advice rather than incurring the costs of professional legal services. Online FAQs and other legal resources have proven popular as means of sourcing legal information without consulting a lawyer directly (often made available by legal professionals as a kind of <a href="http://en.wikipedia.org/wiki/Loss_leader">loss leader</a> to attract potential clients). Individual resources are inevitably limited in content, but in the aggregate the free World Wide Web (i.e. excluding subscription websites) is a fairly comprehensive source of legal information. The problem for the untrained is in finding relevant information and distinguishing the accurate and up-to-date sources from the incorrect and out-of-date. A semantic legal query application that enables laymen to access comprehensive, up-to-date legal information in response to their queries would satisfy much of the demand for simpler legal advice, reducing the demand for competing professional advice—if priced right. Even though these applications may not rival good lawyers in the quality of the service, not all consumers of legal services are concerned with getting the best quality. Good-enough might well do.</p>
<h1>More than machines</h1>
<p>Of course, many, if not all, lawyers would strongly resist being described as “optimum information retrieval” machines. Most would see their role as going well beyond merely delivering statements of what the law is to their clients. Rather, they are in the business of delivering solutions, offering advice on how to deal with certain situations, how to handle particular disputes, how to structure transactions, etc. Yet it is undeniable that lawyers, especially junior lawyers, spend much of their time searching for relevant information and assimilating it into bespoke legal advice. What the technological possibilities outlined in this post suggest is that simpler legal advice can likely be significantly automated, while for more complex queries, Semantic Web-based applications could considerably enhance fee-earner productivity in producing legal advice.</p>
<p><em>(Coming soon: Part 5 &#8211; Legal Documents.)</em></p>
<hr size="1" /><a href="#_ftnref">[1]</a> As LaVern Pritchard pointed out in a comment to Part 3 of this series, “legal information” need not include only legal texts—see his article on applying taxonomies to the domain of legal practice <a href="http://www.priweb.com/betterlawfirms.htm">here</a>; see also <a href="http://www.springerlink.com/content/l4fwyeatg4nfxwck/fulltext.pdf">this account</a> of NetCase, a semantic system designed to assist lawyers with transnational cross-referrals.</p>
<p><a href="#_ftnref">[2]</a> See discussion of “optimum retrieval” in <a href="../2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/">Part 1</a> of this series.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-4/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>STLR is on Twitter</title>
		<link>http://www.stlr.org/2010/04/stlr-is-on-twitter/</link>
		<comments>http://www.stlr.org/2010/04/stlr-is-on-twitter/#comments</comments>
		<pubDate>Mon, 12 Apr 2010 16:32:07 +0000</pubDate>
		<dc:creator>STLR</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=919</guid>
		<description><![CDATA[If regular RSS and Google reader aren&#8217;t your preferred methods of consumption, you can receive a tweet each time we post a new story, which will be once or twice per week during the academic year.  Our Twitter name is columbiastlr, and you can find our Twitter page here.
To any aspiring Twitter-ers: signing up for [...]]]></description>
			<content:encoded><![CDATA[<p>If regular RSS and Google reader aren&#8217;t your preferred methods of consumption, you can receive a tweet each time we post a new story, which will be once or twice per week during the academic year.  Our Twitter name is <em>columbiastlr</em>, and you can find our Twitter page <a href="http://twitter.com/columbiastlr">here</a>.</p>
<p>To any aspiring Twitter-ers: signing up for Twitter is free and pretty easy.  You can read more about it on <a href="http://en.wikipedia.org/wiki/Twitter">Wikipedia</a> or on Twitter&#8217;s <a href="http://twitter.com/about">About</a> page.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/04/stlr-is-on-twitter/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Semantic Lawyering: How the Semantic Web Will Transform the Practice of Law (Part 3)</title>
		<link>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-3/</link>
		<comments>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-3/#comments</comments>
		<pubDate>Thu, 08 Apr 2010 03:37:51 +0000</pubDate>
		<dc:creator>Brian Harley</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=903</guid>
		<description><![CDATA[(Check out Part 1 and Part 2, if you missed them.)
A machine-readable version of the law?
David Siegel, an entrepreneur and early blogger, recently published a book entitled Pull, The Power of the Semantic Web to Transform Your Business, the first “business” book about the Semantic Web. Siegel devotes one chapter to exploring the possible impact [...]]]></description>
			<content:encoded><![CDATA[<p><em>(Check out <a href="http://www.stlr.org/2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/">Part 1</a> and <a href="http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/">Part 2</a>, if you missed them.)</em></p>
<h1>A machine-readable version of the law?</h1>
<p><a href="http://www.dsiegel.com/">David Siegel</a>, an entrepreneur and early blogger, recently published a book entitled <a href="http://www.amazon.com/gp/product/1591842778?ie=UTF8&amp;tag=thpoofpu09-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1591842778">Pull, The Power of the Semantic Web to Transform Your Business</a>, the first “business” book about the Semantic Web. Siegel devotes one chapter to exploring the possible impact of the Semantic Web on the law and lawyers. An enthusiastic backer of the new technology, Siegel sees huge potential for the Semantic Web to transform the work of lawyers. He believes that work on legal taxonomies and formalized rules may result in “a set of semantic rules that can then serve as the machine-readable version of the law.”<a href="#_ftn1">[1]</a> This is the kind of structured legal data that would make the intelligent legal queries outlined above possible. It raises the question of the future utility of lawyers in a world where much of what they now do can be performed by computer applications. Why go to a lawyer if you can get an authoritative, complete and up-to-date statement of the law online? If the law can be fully specified as a formalized set of machine-readable rules, would we even need lawyers and judges, or could they be replaced with computers and Semantic engineers of the law?</p>
<h1>A note of caution</h1>
<p>I ought to sound a note of caution at this point. The idea of reformulating all of the rules of law as a formal system, with precise classifications of entities and rules governing their interactions, has been tried before. Most students of the law will, at some point in their studies, come across discussions of the German Civil Code (the <a href="http://en.wikipedia.org/wiki/B%9Frgerliches_Gesetzbuch">BGB</a>), which was drafted over a century ago with precisely that aim in mind. It failed. The law has proven too malleable, too changeable, and too subjective a system to codify with mathematical rigor. There is little reason to believe that the Semantic Web will succeed where others have failed, at least in the foreseeable future. Few fields of human activity are as centrally focused on interpretation of often conflicting texts, and as acutely concerned with the ambiguity of human language, as the law. Though the law may be a body of rules, those rules are not of the clear-cut variety that easily lend themselves to formalization.</p>
<h1>How smart does “smart” need to be?</h1>
<p>That does not mean, however, that the taxonomies and rules of the Semantic Web are useless when it comes to the law. Difficult exercises of interpretation may be required in deciding “hard cases” and creative thinking may be needed in handling more complex, high-level legal issues, but much of the daily practice of the law is far less complex or ambiguous. Is a high level of legal expertise really required in producing a first draft of simple terms and conditions or a memo setting out routine advice? The parameters of these kinds of tasks should be relatively easy to formalize. And even if the semantic formalization of the law were less than perfect, a system that understands<em> </em>the structure of legal queries and can achieve near-optimum retrieval could vastly increase the efficiency of legal researchers. <a href="#_msocom_1">[Unknown A1]</a></p>
<p>Again, taxonomies and rules need not be all-encompassing to be useful. The Semantic Web is not the latest incarnation of pie-in-the-sky artificial intelligence. At the heart of the SemanticWeb is the task of developing dictionaries of concepts and rules to make data smarter, and that is a task that can be done piecemeal. Making data smarter does not have to mean encoding all of the subtleties of human language into the data. If an area of legal practice is concerned with a reasonably small set of clearly defined rules, much of the relevant law may be susceptible to being translated into machine-readable standards. Consider an area of regulatory compliance such as food labeling, which involves rules prescribing particular information formats and content, lists of words that must, can, or cannot be used under certain conditions, and similarly well-defined rules. Translating most of these into a “machine readable version of the law” that could serve as the basis for automated compliance-checking systems hardly seems unrealistic. What about other, less straightforward areas of the law? Even where the area evades complete formalization, as will often be the case, semantic applications may significantly enhance the productivity of fee-earners by dealing with routine, low-skill work while leaving the subtler points of law to the flesh-and-blood professional. So, what kinds of application might achieve these efficiency gains?</p>
<p><em>(Coming soon: Part 4 – Smart documents and semantic contracts)</em></p>
<hr size="1" /><a href="#_ftnref">[1]</a> David Siegel, <em>The Power of the Semantic Web to Transform Your Business</em>, p. 187.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-3/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Semantic Lawyering: How the Semantic Web Will Transform the Practice of Law (Part 2)</title>
		<link>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/</link>
		<comments>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/#comments</comments>
		<pubDate>Fri, 02 Apr 2010 13:15:46 +0000</pubDate>
		<dc:creator>Brian Harley</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=898</guid>
		<description><![CDATA[(If you missed part 1 of the series, check it out here.)
What is the Semantic Web?
The Semantic Web is a way of making data smart. The idea is, rather than building smart applications that can analyze “dumb” data, you make the data smart in the first place. The problem with dumb data is that the [...]]]></description>
			<content:encoded><![CDATA[<p><em>(If you missed part 1 of the series, check it out <a href="http://www.stlr.org/2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/">here</a>.)</em></p>
<h1>What is the Semantic Web?</h1>
<p>The Semantic Web is a way of making data smart. The idea is, rather than building smart applications that can analyze “dumb” data, you make the data smart in the first place. The problem with dumb data is that the ability of applications to make sense of human language is limited. Currently, the information in most web pages and text documents is “human language,” encoded in data formats that tell computers nothing about their <em>meaning</em>. What the standards that make up the core of the Semantic Web do is to provide data formats that can be used to make the meaning of information explicit.</p>
<h1>Dumb data vs. smart data</h1>
<p>So how is this done? What differentiates smart data from dumb data? If you view the source code of this web page (try it – it’s in <em>View</em> &gt; <em>Source</em> in Explorer; <em>View &gt; Page Source</em> in Firefox, <em>View &gt; View Source </em>in Safari), you will see some text and a lot of “tags” between angled brackets, such as “&lt;p&gt;” and “&lt;div id=‘header’&gt;.” This is HTML, the mark-up language in which most information currently on the World Wide Web is encoded. It tells your browser how to display the text and images, and where to redirect when you click on a link – but not much else. Information encoded in plain HTML is dumb data. Let’s consider an example. In HTML, you might have the following text:</p>
<p>&lt;p&gt;Sun is a subsidiary of Oracle.&lt;/p&gt;</p>
<p>The HTML tells your browser that text enclosed between the opening tag “&lt;p&gt;” and the closing tag “&lt;/p&gt;” should be displayed as a single paragraph, and nothing more. A simple search engine might hit on this sentence even if I intended to search for the “sun,” as in the sun in the sky, or an “oracle,” as in the Oracle of Delphi. An application with advanced language-processing abilities might be able to deduce from the absence of an article (“a” or “the”) that “Sun” and “Oracle” are names. It might also deduce from the mention of “subsidiary” that the sentence in fact refers to names of corporations. In the current state of technology, this is likely to be a hit-and-miss process.</p>
<h1>Making data smart</h1>
<p>The idea behind the Semantic Web is to attach machine-readable metadata (data about data) to information that can be interpreted by any Semantic Web application. To better understand what this involves, imagine a mark-up language that enables you to specify what the things being referred to <em>are. </em>Imagine that this mark-up language enabled you to add tags to your data to specify things like:</p>
<p>&lt;item <strong><em>this is a corporation</em></strong>&gt; Sun &lt;/item&gt;</p>
<p>&lt;item <strong><em>this is a legal relationship between two corporations</em>&gt; </strong>is a subsidiary of &lt;/item&gt;</p>
<p>&lt;item <strong><em>this is a corporation</em></strong>&gt; Oracle &lt;/item&gt;</p>
<p>Even better, imagine that, rather than just labeling things, you could refer to a source of information on the web that tells you more about each of these things, e.g.:</p>
<p>&lt;item <strong><em>see</em></strong><em> </em>http://www.dbpedia.org/resource/Oracle_Corporation&gt; <strong>Oracle </strong><em>&lt;/</em>item&gt;<strong> </strong></p>
<p>The link referred to is a “resource” – a bundle of data available online that describes something. This resource contains data, encoded in a machine-readable format, which might state that Oracle is a Delaware corporation, that it is headquartered in Redwood City, California, that the current CEO is Larry Ellison, etc.</p>
<p>Now let’s take this one step further, and imagine that, when that “Oracle” resource states that Oracle is a “Delaware corporation,” it in turn refers to an online resource that defines the term “Delaware corporation.” That definition might specify that a Delaware corporation is a kind of legal person, that it should have a certificate of incorporation, bylaws, a board of directors, etc. Of course, these statements would also be machine-readable, and could in turn refer to other resources (defining “legal person,” “certificate of incorporation,” “board of directors,” etc.).</p>
<h1>Classifications and rules</h1>
<p>Where does it all end? It ends with “thing.” That is, a “corporation” is a “legal person,” which is a kind of “person,” which is a kind of “thing.” A “certificate of incorporation” is a “legal document,” which is a kind of “document,” which is a kind of “thing.” Everything is a thing, and so every “resource” is a kind of thing, which fits into a classification of things (a taxonomy). One of the most important aspects of the Semantic Webs is defining taxonomies of different kinds of things using machine-readable formats. There is no need for a single, all-encompassing taxonomy which defines every possible thing: partial taxonomies can define a few terms by referring to other taxonomies, and all of these interlinked taxonomies ultimately refer to the most general standards (remember, this can be done because they are all online).</p>
<p>The Semantic Web also goes beyond mere classifications, allowing you to specify rules for each kind of thing. For example, you could specify that a “director” of a “Delaware corporation” can be a natural person, but cannot be a legal person. You could specify that the property (predicate) of “having a subsidiary” must have a corporation as its subject and another, different corporation as its object.</p>
<p>The foregoing does not purport to be a technical exposition of the Semantic Web, but I hope you get the idea. The core of the Semantic Web is a set of precisely defined standards that can be used to make data smarter by making explicit the underlying structure of the information.<a href="#_ftn1">[1]</a> Online classifications and rules enable applications to identify and analyze the data in much greater depth and with much greater precision than existing alternative technologies.</p>
<h1>The state of the technology</h1>
<p>Not all of the pieces of the system outlined above are in place. The basic standards of the Semantic Web, including the Resource Description Framework (<a href="http://en.wikipedia.org/wiki/Resource_Description_Framework">RDF</a>) and the Web Ontology Language (<a href="http://en.wikipedia.org/wiki/Web_Ontology_Language">OWL</a>), are by now reasonably mature and stable standards. However, there is still a good deal of work to be done and problems to be ironed out before the vision of the Semantic Web is fully made a reality (see <a href="http://en.wikipedia.org/wiki/Semantic_Web#Challenges">here</a> and <a href="http://www.oreillynet.com/xml/blog/2006/06/the_7_flaws_of_the_semantic_we.html">here</a>). Nevertheless, an increasing number of big names have been adopting Semantic Web standards to structure their data (<a href="http://open.blogs.nytimes.com/tag/semantic-web/">New York Times</a>, <a href="http://www.semanticweb.com/news/follow_the_money_with_redesigned_recoverygov_139495.asp">recovery.gov</a>, <a href="http://www.slideshare.net/fantasticlife/semweb-at-the-bbc?src=embed">BBC</a>, <a href="http://www.opencalais.com/">Thomson Reuters</a>). Identifying the real-world future implications of the Semantic Web is no longer science fiction, even for the legal industry.</p>
<p><em>(Next up: <a href="http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-3/">Part 3 &#8211; A Machine Readable Version of the Law?</a>)</em></p>
<hr size="1" /><a href="#_ftnref">[1]</a> Siegel, <a href="http://www.amazon.com/gp/product/1591842778?ie=UTF8&amp;tag=thpoofpu09-20&amp;linkCode=as2&amp;camp=1789&amp;creative=9325&amp;creativeASIN=1591842778">Pull, The Power of the Semantic Web to Transform Your Business</a>, p.13.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Semantic Lawyering: How the Semantic Web Will Transform the Practice of Law (Part 1)</title>
		<link>http://www.stlr.org/2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/</link>
		<comments>http://www.stlr.org/2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/#comments</comments>
		<pubDate>Wed, 31 Mar 2010 13:03:14 +0000</pubDate>
		<dc:creator>Brian Harley</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[semantic web]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=891</guid>
		<description><![CDATA[“Predicting the future is a hazardous business.” So cautions Richard Susskind in his recent exercise in legal futurology, The End of Lawyers? Rethinking the Nature of Legal Services, citing a number of amusingly inaccurate predictions made over the years about the future of IT. In a series of posts, I venture into that hazardous business [...]]]></description>
			<content:encoded><![CDATA[<p>“Predicting the future is a hazardous business.” So cautions <a href="http://www.susskind.com/">Richard Susskind</a> in his recent exercise in legal futurology, <a href="http://www.amazon.com/End-Lawyers-Rethinking-Nature-Services/dp/0199541728">The End of Lawyers? Rethinking the Nature of Legal Services</a>, citing a number of amusingly inaccurate predictions made over the years about the future of IT. In a series of posts, I venture into that hazardous business by taking a look at the <a href="http://en.wikipedia.org/wiki/Semantic_Web">Semantic Web</a>, an exciting current development in IT, and considering how it might impact the law and lawyers. The Semantic Web is an emerging technology which promises to vastly increase the ability of computers to analyze information, resulting in smarter applications, more efficient search engines, and many more improvements to our current ability to retrieve and process data. Applied to the law, the Semantic Web may have a transformative effect on the way lawyers carry out their business. In this post, I explain why.</p>
<h1>The problem: too much data</h1>
<p>There are currently over 25 billion web pages on the World Wide Web. In fact, that figure covers only the indexable web, so those 25 billion pages may be only the tip of the iceberg (see <a href="http://quod.lib.umich.edu/cgi/t/text/text-idx?c=jep;view=text;rgn=main;idno=3336451.0007.104">this paper</a> on the “deep web”). Looking beyond the web to total production of information, a study by International Data Corp carried out in 2008 predicts that 1,200 exabytes of data will be generated in 2010 (cited by The Economist <a href="http://www.economist.com/specialreports/displaystory.cfm?story_id=15557421">here</a>). To put this in perspective, note that one <a href="http://en.wikipedia.org/wiki/Byte">byte</a> of information is a sequence of eight bits – a sequence of eight digits which can be either one or zero. One <a href="http://en.wikipedia.org/wiki/Exabyte">exabyte</a> is 1,000,000,000,000,000,000 bytes (10<sup>18</sup>), or one billion gigabytes. The text of this blog post, in plain text format, takes up about 13,000 bytes. The challenge of identifying and retrieving relevant data in this ever-expanding universe of information is growing in step with the volume of the information itself. Achieving what Richard Susskind calls “information satisfaction” – getting the information you want, and only the information you want – in the face of this exponential expansion is an increasingly daunting task. This is even more true of the challenge of achieving “optimum retrieval” – for a given query, being confident that the single best document has been returned. Google’s “<a href="http://www.google.com/support/websearch/bin/answer.py?hl=en&amp;answer=30735">I’m feeling lucky</a>” option may sometimes be surprisingly accurate, but not with any reliable degree of certainty.</p>
<h1>Too much legal data</h1>
<p>The problem of too much data will be familiar to law students, associates, and anyone else who has carried out legal research. The volume of legislation, case law, commentary on the law, and the like is no exception to the current phenomenon of information expansion. “Googling it” can provide a good first stab at some legal problems, but no lawyer who fears malpractice suits would rely exclusively on results from a general search engine. Commercial legal databases provide more structured and authoritative databanks of legal information, but they are expensive, difficult to use for the untrained, and the search is still conducted mostly by means of citations and keywords. Whether legal sources are identified by a search engine or using a commercial database, the actual task of analyzing and interpreting the texts is conducted by the lawyer – not the machine.</p>
<p>If I want to ascertain, say, what information I must provide in the certificate of incorporation of a Delaware Corporation, I can search “Delaware corporation law,” click through the link that looks most relevant, scan the text (perhaps with the help of the “find” function), identify the relevant section, and read through it to draw up a list of the requirements. If I am especially diligent, I might also check case law in a commercial database to see if judicial decisions have added to or qualified these requirements. Now imagine that, instead of proceeding by keyword searches and “manual” analysis, I could simply enter the query “What information must be provided in the certificate of incorporation of a Delaware Corporation?” and the search engine returned a <em>complete, authoritative</em> list of all of the requirements, along with any qualifications or additions made by the case law. That, in a nutshell, is the promise of the Semantic Web.</p>
<p><em>(Next up: <a href="http://www.stlr.org/2010/04/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-2/">Part 2 &#8211; What is the Semantic Web?</a></em>)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/03/semantic-lawyering-how-the-semantic-web-will-transform-the-practice-of-law-part-1/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>STLR Link Roundup &#8211; January 15, 2010</title>
		<link>http://www.stlr.org/2010/01/stlr-link-roundup-january-15-2010/</link>
		<comments>http://www.stlr.org/2010/01/stlr-link-roundup-january-15-2010/#comments</comments>
		<pubDate>Sat, 16 Jan 2010 20:59:22 +0000</pubDate>
		<dc:creator>STLR</dc:creator>
				<category><![CDATA[Copyright]]></category>
		<category><![CDATA[Data Breach Law]]></category>
		<category><![CDATA[International Law]]></category>
		<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[Patents]]></category>
		<category><![CDATA[Privacy]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=775</guid>
		<description><![CDATA[Here&#8217;s the latest on the STLR radar:

Twitter is a source of evidence for a murder charge, reports the New York Daily News.  But could those tweets be copyrighted?  Law.com&#8217;s Law Technology News weighs in.


The Electronic Frontier Foundation provides a good, link-heavy analysis of the unanswered questions surrounding Google&#8217;s decision to stop censoring their Chinese services.


For [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s the latest on the STLR radar:</p>
<ul>
<li>Twitter is a source of evidence for a murder charge, reports the <a id="e-0y" title="New York Daily News" href="http://www.nydailynews.com/news/ny_crime/2010/01/10/2010-01-10_twitter_becomes_key_evidence_in_case_after_jameg_blake_charged_with_murdering_fr.html">New York Daily News</a>.  But could those tweets be copyrighted?  Law.com&#8217;s Law Technology News <a id="f8tc" title="weighs in" href="http://www.law.com/jsp/lawtechnologynews/PubArticleLTN.jsp?id=1202438916120&amp;rss=ltn">weighs in</a>.</li>
</ul>
<ul>
<li>The Electronic Frontier Foundation provides a good, link-heavy analysis of the <a id="m4v1" title="unanswered questions" href="http://www.eff.org/deeplinks/2010/01/google-china-unanswered-questions">unanswered questions</a> surrounding Google&#8217;s decision to stop censoring their Chinese services.</li>
</ul>
<ul>
<li>For some reason, Psystar keeps fighting Apple, posts <a id="gyy9" title="Gizmodo" href="http://gizmodo.com/5449400/there-is-no-quit-in-psystar-but-there-should-be">Gizmodo</a>.</li>
</ul>
<ul>
<li>Custom and Border Protection&#8217;s laptop searches may have gone too far, as revealed in a series of Freedom of Information Act requests for documents, reports <a id="xgvh" title="Gizmodo" href="http://gizmodo.com/5449455/official-laptop-search-documents-reveal-sloppy-data-handling?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+gizmodo%2Ffull+%28Gizmodo%29&amp;utm_content=Google+Reader">Gizmodo</a>.  EFF is <a id="y77f" title="looking" href="http://www.eff.org/deeplinks/2010/01/have-you-been-subjected-suspicionless-laptop-searc">looking</a> for potential plaintiffs.</li>
</ul>
<ul>
<li>The Wall Street Journal Law Blog <a id="o-yc" title="analyzes" href="http://blogs.wsj.com/law/2010/01/15/medical-technology-and-the-law-on-the-rights-of-surrogate-mothers/?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+wsj%2Flaw%2Ffeed+%28WSJ.com%3A+Law+Blog%29&amp;utm_content=Google+Reader">analyzes</a> how advances in medical science have impacted the rights of surrogate mothers.</li>
</ul>
<ul>
<li>IBM gets the more U.S. patents than any other company for the 17th consecutive year, says MSN&#8217;s <a id="fyel" title="Moneycentral" href="http://news.moneycentral.msn.com/printarticle.aspx?feed=PR&amp;date=20100112&amp;id=10979684">Moneycentral</a>.  But Microsoft&#8217;s much smaller patent portfolio is worth more, reports <a id="yqiq" title="The 271 Patent Blog" href="http://271patent.blogspot.com/2010/01/patent-portfolios-more-value-less.html">The 271 Patent Blog</a>.</li>
</ul>
<ul>
<li>The Prior Art offers <a id="dbo4" title="a fascinating look" href="http://thepriorart.typepad.com/the_prior_art/2010/01/jurors-from-i4i-v-microsoft.html">a fascinating look</a> into the jury&#8217;s decision in i4i v Microsoft, complete with juror interviews.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/01/stlr-link-roundup-january-15-2010/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>STLR Link Roundup &#8211; January 8, 2010</title>
		<link>http://www.stlr.org/2010/01/stlr-link-roundup-january-8-2009/</link>
		<comments>http://www.stlr.org/2010/01/stlr-link-roundup-january-8-2009/#comments</comments>
		<pubDate>Fri, 08 Jan 2010 23:58:39 +0000</pubDate>
		<dc:creator>STLR</dc:creator>
				<category><![CDATA[Copyright]]></category>
		<category><![CDATA[File Sharing]]></category>
		<category><![CDATA[International Law]]></category>
		<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[Link Roundup]]></category>
		<category><![CDATA[Patents]]></category>
		<category><![CDATA[Trademark]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=771</guid>
		<description><![CDATA[Here&#8217;s the latest on the STLR radar:

Chief U.S. District Judge Vaughn Walker in San Francisco decided to allow showing the trial challenging California&#8217;s Proposition 8 on YouTube, reports the San Francisco Chronicle.  The Wall Street Journal Law Blog questions whether that&#8217;s a good thing.


Patent Librarian notes that Wikipedia citations in patent applications are up 59%, [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s the latest on the STLR radar:</p>
<ul>
<li>Chief U.S. District Judge Vaughn Walker in San Francisco decided to allow showing the trial challenging California&#8217;s Proposition 8 on YouTube, reports the <a id="h1b_" title="San Francisco Chronicle" href="http://www.sfgate.com/cgi-bin/article.cgi?f=/c/a/2010/01/07/BA121BEGI8.DTL">San Francisco Chronicle</a>.  The Wall Street Journal Law Blog <a id="rbg3" title="questions" href="http://blogs.wsj.com/law/2010/01/07/prop-8-trial-to-be-shown-on-youtube-is-that-a-good-thing/">questions</a> whether that&#8217;s a good thing.</li>
</ul>
<ul>
<li><a id="zmhy" title="Patent Librarian" href="http://patentlibrarian.blogspot.com/2010/01/wikipedia-citations-in-patents-up-59.html">Patent Librarian</a> notes that Wikipedia citations in patent applications are up 59%, but <a id="e.6y" title="Patenly-O" href="http://www.patentlyo.com/patent/2010/01/wikipedia-citations-in-patents-up-59-percent.html">Patently-O</a> puts that increase in perspective.</li>
</ul>
<ul>
<li>A report commissioned by the French government recommends taxing Google on their online advertising revenues in France to help fund legal outlets to buy media hurt by online piracy, reports the <a id="john" title="Associated Press" href="http://www.mercurynews.com/business-headlines/ci_14141563">Mercury News</a>.  President Sarkozy supports the measure, says <a id="v68j" title="PC World" href="http://www.pcworld.com/article/186356/president_sarkozy_adds_his_support_to_french_google_tax_plan.html">PC World</a>.</li>
</ul>
<ul>
<li>The Wall Street Journal <a id="e63c" title="reports" href="http://online.wsj.com/article/SB10001424052748703436504574640623301172810.html">reports</a> that Philip K. Dick&#8217;s estate claims Google infringed on its intellectual property by using the name &#8220;Nexus One&#8221; for the new Google-branded phone.  It brings to mind <a id="qann" title="this recent post" href="http://sethgodin.typepad.com/seths_blog/2009/12/how-to-protect-your-ideas-in-the-digital-age.html">this recent post</a> by Seth Godin.</li>
</ul>
<ul>
<li>The Electronic Frontier Foundation <a id="fksm" title="responds" href="http://www.eff.org/deeplinks/2010/01/et-tu-u2">responds</a> to Bono&#8217;s recent New York Times <a id="izgi" title="Op-Ed" href="http://www.nytimes.com/2010/01/03/opinion/03bono.html">Op-Ed</a>, in which the musician / global icon lamented media piracy and suggested digital tracking be used to help criminal enforcement.</li>
</ul>
<ul>
<li><a id="l.gf" title="Law.com" href="http://www.law.com/jsp/lawtechnologynews/PubArticleLTN.jsp?id=1202437419175&amp;rss=ltn&amp;hbxlogin=1">Law.com</a> provides an insightful guide to mining web 2.0 as a source of evidence.</li>
</ul>
<ul>
<li>The Colorado Department of Transportation created an iPhone app to tell users if they&#8217;re too drunk to drive, the latest in a series of state efforts &#8220;to reach out to the Twitter-iPhone-Facebook generation,&#8221; according to the <a id="zwod" title="Wall Street Journal" href="http://online.wsj.com/article/SB126222210370911181.html">Wall Street Journal</a>.</li>
</ul>
<ul>
<li>The Ninth Circuit Court of Appeals affirmed [<a id="wz-2" title="decision, pdf" href="http://www.ca9.uscourts.gov/datastore/opinions/2009/12/28/08-55622.pdf">decision, pdf</a>] a district court ruling that tasers should only be used in limited circumstances, as they pose a greater threat to their targets than other non-lethal police weapons.  The San Jose Mercury News <a id="qh:u" title="reports" href="http://www.mercurynews.com/breaking-news/ci_14090157?nclick_check=1">reports</a> on the suit that originated from a city police officer using a stun gun on a San Jose State student.</li>
</ul>
<ul>
<li>Broadcom agreed to settle the securities fraud class action against it, says the <a id="kuuu" title="Wall Street Journal Law Blog" href="http://blogs.wsj.com/law/2009/12/29/broadcom-agrees-to-pay-160-million-to-settle-securities-suit/">Wall Street Journal Law Blog</a>.</li>
</ul>
<ul>
<li>The L.A. Times <a id="rafx" title="reports" href="http://www.latimes.com/entertainment/news/arts/la-et-science-center29-2009dec29,0,6400745.story">reports</a> that the California Science Center has been sued for canceling a showing of film attacking Darwinian evolution and promoting intelligent design.</li>
</ul>
<ul>
<li>Blizzard helps police make a drug arrest of a suspect tracked by his World of Warcraft account, posts <a id="uxea" title="kokomo perspective" href="http://kokomoperspective.com/news/local_news/article_15a0a546-f574-11de-ab22-001cc4c03286.html">kokomo perspective</a>.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2010/01/stlr-link-roundup-january-8-2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RECAP Attempts to &#8220;Turn PACER Around&#8221;</title>
		<link>http://www.stlr.org/2009/12/recap-attempts-to-turn-pacer-around/</link>
		<comments>http://www.stlr.org/2009/12/recap-attempts-to-turn-pacer-around/#comments</comments>
		<pubDate>Sat, 05 Dec 2009 08:09:34 +0000</pubDate>
		<dc:creator>Rajiv Batra</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[access to law]]></category>
		<category><![CDATA[litigation]]></category>
		<category><![CDATA[PACER]]></category>
		<category><![CDATA[RECAP]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=717</guid>
		<description><![CDATA[
The American legal profession is not generally known for adopting new technology, setting up open access to laws and legal procedures, or offering things for free.  Internet culture is the opposite: fervently experimental, open, and free/shared whenever possible.  Private intersections of the two have fallen on a continuum, from closed and expensive like Lexis/Westlaw, to [...]]]></description>
			<content:encoded><![CDATA[<p><script type="text/javascript"></script></p>
<p>The American legal profession is not generally known for adopting new technology, setting up open access to laws and legal procedures, or offering things for free.  Internet culture is the opposite: fervently experimental, open, and free/shared whenever possible.  Private intersections of the two have fallen on a continuum, from closed and expensive like Lexis/Westlaw, to open and free like the new <a href="http://scholar.google.com/">Google Scholar</a> (see our analysis of Google Scholar <a href="../2009/11/google-scholar-free-case-law-for-everyone/">here</a>).  One of the latest innovations in open access to previously closed legal information is RECAP, a project of Princeton&#8217;s Center for Information Technology Policy.  Created by computer scientists with a policy tilt—not J.D.s—RECAP takes aim at PACER, the online service for accessing federal court records.  RECAP threatens to unravel PACER&#8217;s paid-access system <a href="https://www.recapthelaw.org/why-it-matters/">in the name of</a> &#8220;hastening the day when court records would be freely available to the general public via the Internet.&#8221;</p>
<h1>PACER</h1>
<p><a href="http://pacer.psc.uscourts.gov/pacerdesc.html">PACER</a>, which stands for Public Access to Court Electronic Records, &#8220;is an electronic public access service that allows users to obtain case and docket information from Federal Appellate, District and Bankruptcy courts, and the U.S. Party/Case Index via the Internet.&#8221;   Started as a pilot program in 1989 (PACER chronology <a href="http://pacer.psc.uscourts.gov/documents/epachron.pdf">pdf)</a>, PACER allows registered parties to access public court documents from most federal courts, and is popular—even essential—for many attorneys litigating in federal court.  It is not a single database – each participating federal court operates its own document database, and PACER provides centralized registration, billing, and technical support.</p>
<p>PACER is funded by fees collected from users.  Users pay eight cents per page for records accessed through PACER—primarily pdfs of court papers, and also search results and document lists.  PACER&#8217;s fee structure is mandated by Congress, originally in 1988 and reapproved most recently in the E-Government Act of 2002.  There are exceptions to the eight cent charge; for example, &#8220;written opinions&#8221; by judges &#8220;that set … forth a reasoned explanation for a court&#8217;s decision&#8221; are available for free, and the maximum charge for any single document is capped at 30 pages ($2.10).  Additionally, courts may <a href="http://pacer.psc.uscourts.gov/faq.html#GP18">exempt</a> users from paying fees &#8220;in order to avoid unreasonable burdens and to promote public access to such information.&#8221;  However, &#8220;exemption of PACER fees will only apply in the jurisdiction that issued the order,&#8221; so to receive a fee exemption applying to all documents accessible through PACER would necessitate petitioning each of the federal courts separately.</p>
<p>Though ahead of the internet-curve at the time of its inception, PACER has come under fire for its &#8220;<a href="http://www.nytimes.com/2009/02/13/us/13records.html">cumbersome</a>&#8221; interface and its continued use of fees in an age of increasingly free access to all sorts of information online.   Critics argue that in America&#8217;s common-law system, court documents and decisions form the substance of the law itself, and that PACER&#8217;s fees restrict access to what should be universally available for free.  Moreover, PACER collects more money in fees than are needed to run its servers and maintain the system—<a href="http://www.nytimes.com/2009/02/13/us/13records.html?pagewanted=all">$150 million more, as of 2008</a>—which undermines any claim that eliminating PACER&#8217;s fees would eliminate PACER itself.  There have been <a href="http://managingmiracles.blogspot.com/2009/02/lieberman-letter-on-pacer.html">calls for legislative change</a> to open up access to PACER (among other repositories of government data), but little movement toward actual change – especially since federal funding would have to replace the fees used to keep PACER operational.  Without any such changes, the information in PACER remains closed off—not all of its documents make it onto Westlaw or Lexis, and almost none of them will be caught in Google Scholar&#8217;s new legal net.  There have been <a href="http://public.resource.org">efforts</a> to pay for and share large sweeps of <a href="http://pacer.resource.org">data from PACER</a>, but they have been necessarily incomplete.</p>
<h1>RECAP</h1>
<p>Enter the technological, crowdsourced solution: RECAP.  A project from Princeton&#8217;s Center for Information Technology Policy, RECAP—with the motto &#8220;Turning PACER Around&#8221; explaining the backwards name—opens up access to the centralized documents by targeting users of the system instead of  exclusively pushing for legislative policy changes.  RECAP is a Firefox browser plugin that activates when a user logs into PACER.  It then has two functions: RECAP automatically detects any documents the user accesses and uploads copies to the <a href="http://www.archive.org">Internet Archive</a>, and it alerts the user if any of the documents listed on the current PACER screen are available in the archive.  RECAP also creates clickable links to the free, archived documents that appear on the page along with the PACER links.  Thus RECAP users collectively share the documents they access from PACER and each document need only be paid for once to be free for everyone thereafter.</p>
<h1>Wait&#8230; are they allowed to do that?</h1>
<p>Though RECAP&#8217;s copy-and-upload function might ring copyright infringement alarm bells, the system was designed to open access to legal documents without violating any laws.  With regard to copying and sharing documents written by the court, <a href="http://www.copyright.gov/title17/92chap1.html#105">17 U.S.C. 105</a> precludes material by the federal government from receiving copyright protection, saying &#8220;Copyright protection under this title is not available for any work of the United States Government.&#8221;  RECAP doesn&#8217;t only facilitate sharing government works, however; it also works on the briefs, motions, and exhibits submitted by litigation parties.  That material does not fall under 17 U.S.C. 105&#8217;s exemption, and thus is subject to copyright protection at its creation.  CITP <a href="https://www.recapthelaw.org/about/">suggests</a> that RECAP should still be protected from liability under the doctrine of fair use, which allows uses of copyrighted works without permission for things like commentary, news reporting, and research.  However, this is far from a settled point of law—even the posting of briefs by traditional legal research companies Lexis and Westlaw has been <a href="http://www.volokh.com/posts/1248389303.shtml">called into question</a>.</p>
<p>CITP acknowledges this, and says it would like to &#8220;establish [the] precedent&#8221; that &#8220;citizens . . . have the freedom to share public court documents&#8221; without copyright liability.  If challenged on copyright grounds, CITP&#8217;s fair use defense might not be so robust: though popularly championed among netizen activists, fair use as a doctrine has been weakened over the years, and it provides its strongest defense with regard to private, partial, transformative works.  RECAP provides public, total, and exact copies of the material in question, though its decidedly non-commercial nature would likely weigh in RECAP&#8217;s favor.  CITP&#8217;s better defense might be a claim of implied license—arguing that the act of submitting a brief or exhibit to a court implies permission for it to be copied into the public record and then further copied and distributed by commercial and non-commercial organizations as a means of disseminating the common law.</p>
<p>In addition to considering copyright law when designing RECAP, CITP also made sure use of RECAP would not violate PACER&#8217;s prohibition <a href="https://pacer.psc.uscourts.gov/documents/pacer_policy.pdf">(</a>PACER policy <a href="https://pacer.psc.uscourts.gov/documents/pacer_policy.pdf">pdf</a>) on &#8220;any attempt to collect data from PACER in a manner which avoids billing.&#8221;  To that end, CITP points out that &#8220;RECAP users pay for every document they download from PACER, just like any other user . . . RECAP simply gives users a second option: to easily share documents directly with one another.&#8221;  The Administrative Office of the U.S. Courts has tacitly endorsed this distinction in its <a href="http://pacer.psc.uscourts.gov/announcements/general/exemptnotice.html">Notice for Fee Exempt PACER Customers</a>, in which it notified <em>fee exempt </em>users that they are not allowed to use RECAP because of additional restrictions on transfers of data by that class of users.  Ordinary, paying users are not subject to the limitation, which the Administrative Office has <a href="http://pubcit.typepad.com/clpblog/2009/08/official-word-from-us-courts-feel-free-to-use-recap-with-our-blessing.html">confirmed</a> in response to direct inquiry.  RECAP may threaten PACER&#8217;s long-term fee collection model, but it is not likely to face any significant, unique legal challenges.</p>
<h1>Turning PACER around</h1>
<p>As part of a <a href="../2009/11/google-scholar-free-case-law-for-everyone/">trend</a> toward opening access to American common law, RECAP&#8217;s place at the heart or the periphery of the movement remains to be seen.   Like any crowdsourcing application, RECAP&#8217;s usefulness increases as more people use it.  Yet PACER&#8217;s prime users are large, bill-paying law firms, which tend to be wary about adopting new technology and have little incentive to contribute documents they paid for to a free database.</p>
<p>&#8220;Success&#8221; for RECAP may not be mainstream adoption, however.  Merely by creating the working plugin and calling attention to the problem of restricted access to court documents, CITP has advanced the cause of reforming and opening up access to PACER.  That alone is &#8220;Turning PACER around.&#8221;</p>
<p><em>By Rajiv Batra.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2009/12/recap-attempts-to-turn-pacer-around/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Google Scholar &#8211; Free Case Law For Everyone!</title>
		<link>http://www.stlr.org/2009/11/google-scholar-free-case-law-for-everyone/</link>
		<comments>http://www.stlr.org/2009/11/google-scholar-free-case-law-for-everyone/#comments</comments>
		<pubDate>Fri, 20 Nov 2009 23:22:32 +0000</pubDate>
		<dc:creator>Jane Wu</dc:creator>
				<category><![CDATA[Legal Technologies]]></category>
		<category><![CDATA[google scholar]]></category>
		<category><![CDATA[legal research]]></category>
		<category><![CDATA[lexis]]></category>
		<category><![CDATA[westlaw]]></category>

		<guid isPermaLink="false">http://www.stlr.org/?p=654</guid>
		<description><![CDATA[Given Google&#8217;s dominance in web searches, it seemed it would only be a matter of time before the company entered the legal arena.  This Tuesday, Google added the ability to freely search legal opinions and journal articles through Google Scholar.  According to Google Scholar&#8217;s documentation, the website provides state appellate and supreme court decisions since [...]]]></description>
			<content:encoded><![CDATA[<p>Given Google&#8217;s dominance in web searches, it seemed it would only be a matter of time before the company entered the legal arena.  This Tuesday, Google added the ability to freely search legal opinions and journal articles through <a id="nlns" title="Google Scholar" href="http://scholar.google.com/">Google Scholar</a>.  According to Google Scholar&#8217;s <a id="akmm" title="documentation" href="http://scholar.google.com/intl/en/scholar/help.html">documentation</a>, the website provides state appellate and supreme court decisions since 1950, U.S. federal district, appellate, tax and bankruptcy court decisions since 1923, and U.S. Supreme Court decisions since 1791.  The number of court opinions is likely to increase with time.</p>
<p>On its official <a id="lxrk" title="Google Blog" href="http://googleblog.blogspot.com/2009/11/finding-laws-that-govern-us.html">Google Blog</a>, Google explains that one of the main objectives in making this service available is to &#8220;empower the average citizen by helping everyone learn more about the laws that govern us all&#8221;.  A quick comparison of Google Scholar&#8217;s home page and that of LexisNexis or Westlaw (the giants of the legal research market) makes it clear that Google <span style="background-color: #ffffff;">has prioritized ease of use by the public</span>.  In typical Google style, no fancy toolbars or esoteric database names stand in the way of finding a specific legal opinion.  There is also a conspicuous absence of distracting advertisements on the page (although this could change in the future).</p>
<h2><strong>What do lawyers think?</strong></h2>
<p>Reactions from the legal community have been mixed so far.  <a id="wbuo" title="Some" href="http://lawonmyphone.blogspot.com/2009/11/google-enters-into-realm-of-legal.html">Law on my phone</a> lauds the fact users don&#8217;t have to jump through hoops in order to print a legal opinion (printing is more complicated on Lexis or Westlaw, though more options are provided).  On the other hand, <a id="xvdk" title="some" href="http://dukelawref.blogspot.com/2009/11/google-scholar-adds-free-legal-content.html">Goodson Blogson</a> points out that Google Scholar doesn&#8217;t yet provide the full functionality of a Westlaw or LexisNexis search.  The lack of a citation search option is the most obvious example of such missing functionality.  Due to Google Scholar&#8217;s zero dollar price point, another major limitation is the unavailability of articles from journals requiring a subscription.</p>
<p>Despite its current limitations, the potential is there for Google to eventually expand Google Scholar to include most of the bells and whistles offered by familiar research companies like Westlaw and LexisNexis.  Google can take advantage of the fact that legal opinions are, for the most part, uniformly formatted to efficiently extract relevant data.  In fact, <a id="hviu" title="Advanced Scholar Search" href="http://scholar.google.com/advanced_scholar_search?hl=en&amp;as_sdt=2000">Advanced Scholar Search</a> already provides search parameters for limiting case law searches to particular jurisdictions and/or date ranges.</p>
<p>Of course, the idea of providing free case law searches is not new.  <a id="a_0c" title="Google" href="http://googleblog.blogspot.com/2009/11/finding-laws-that-govern-us.html">Google acknowledges</a> previous pioneers in the industry (including Tim Wu of <a href="http://www.altlaw.org/">AltLaw</a> and Tim Stanley of <a href="http://www.justia.com/">Justia</a>, among others) who have worked to make legal opinions available for the general public.  Given its awareness of these other research engines, it seems promising that Google will take note of the good design elements of these research engines as it develops Google Scholar into a more comprehensive research tool.<br />
<strong><br />
</strong></p>
<h2><strong>Do Westlaw and LexisNexis stand a chance?</strong></h2>
<p>With all of its potential, will Google Scholar eventually make Westlaw and LexisNexis obsolete? Some commentators, such as <a id="rfl-" style="background-color: #ffffff;" title="Free Law Kerfluffle" href="http://www.slaw.ca/2009/11/03/free-law-kerfuffle/">Bob Berring</a><span style="background-color: #ffffff;">, have expressed d</span>oubts about how trustworthy free legal research systems may be.  Indeed, the mere fact that Westlaw and LexisNexis have dedicated research professionals who are able to provide critical analysis and commentary seem hard to beat in terms of legitimacy.  Unless Google wants to dive fully into the legal research business, it will most likely continue to rely on machines for data mining and language processing.  But no matter how fancy Google&#8217;s language processing algorithms are, it is would be difficult to argue that it has developed to the point of replacing human ability to interpret language, especially when it comes to teasing out the nuances in court opinions.</p>
<p>Westlaw and LexisNexis seem to agree (at least for now) and both claim to be unfazed by the prospect of potentially competing with Google Scholar.  Soon after Google announced the updated search feature, <a id="m.l:" title="both companies" href="http://commonscold.typepad.com/commonscold/2009/11/google-scholar-posts-cases-.html">both companies</a> made statements indicating that while Google Scholar may be a good tool for the general public, their legal customers rely on them for specialized information and legal insight.</p>
<p>Nonetheless, there may be reasons for Westlaw and LexisNexis to worry as new generations of law students will now have the opportunity to shop and compare the traditional legal research systems with Google Scholar.  Many of these law students (and likely many lawyers, especially those just starting) are already familiar with many of Google&#8217;s other services, making Google Scholar a natural way to acquire legal information.  For these individuals, Westlaw and LexisNexis may seem overly complicated and difficult to access by comparison.  Moreover, having had relatively less experience in legal research, Westlaw and LexisNexis may not hold as much authority for these younger lawyers.</p>
<p><span style="background-color: #ffffff;"><span style="background-color: #ffffff;">To be clear, e</span>ach </span>of the research options &#8211; the subscription databases and Google&#8217;s free service &#8211; has advantages which the alternative lacks.  In particular, Westlaw and LexisNexis lack an easy-to-use interface (and lack free access), while Google Scholar lacks advanced search capabilities and professional legal commentary.  The perfect legal research tool is likely somewhere in the middle, though Google Scholar will no doubt be part of the arsenal of a savvy lawyer&#8217;s practice.  Whatever anyone says to the contrary, Google&#8217;s presence in a market causes changes, and the legal research market will likely be no exception.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.stlr.org/2009/11/google-scholar-free-case-law-for-everyone/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
