top of page
Search
Writer's pictureStephen O'Malley

Turbocharging your document review with CAL: uncover facts quickly

The legal technology industry often employs the benefits of technology assisted

review (TAR) and other document analytic methods when addressing the needs of a document discovery effort. The typical application of these innovations range from auto-coding documents for specific case issues to concept clustering and near-duplicate identification. When applied in conjunction with consulting support from trusted advisors and the use of industry best practices, these technologies have helped legal teams effectively reduce the cost of document review.


Predictive analytics or TAR has many flavors and application methods. Across all application methods, the application of TAR involves the use of statistics to show that the documents within a sample set are representative of the balance of documents that are subject to review and that certain characteristics of the sample documents can be used to identify similar documents in the remainder of the review set. Measuring the effectiveness and level of statistic reliability of the TAR model often falls to two descriptive values: Recall, or the percentage of relevant documents identified when applying the TAR model, and Precision, or the percentage of truly relevant documents vs. false positives identified when applying the TAR model. The goal of this technique is to limit the number of documents that need to be reviewed by humans and provide a statistical basis that can be vetted and tested for excluding a portion of the documents from the review.


Often, the development of a traditional TAR model can be a time consuming, repetitive, and complex process at a time when there is significant pressure to review the documents as soon as possible in order to meet court-imposed production deadlines, identify the core documents at issue in a given litigation, and ensure that the teams of reviewers selected to review the documents for the case are occupied as soon as they start charging for their time. Until the TAR model reaches satisfactory levels of precision and recall, additional samples, or training sets, are used in an effort to improve the statistical effectiveness of the model. Typically, review will not begin until the documents identified by the model maintain a recall rate that sufficiently meets the needs of the legal team performing the review.


In contrast, continuous active learning (CAL) is a form of TAR that allows the review to begin ASAP with the benefit of keywords, date restrictions, concept search application, and other methods to focus the review on the core issues at hand without waiting for randomly selected sample training sets to be reviewed and tested against the total population of documents. This can save weeks of sample testing and model training on documents that are largely irrelevant to the issues at hand. CAL works by building a dynamic TAR model that takes into account every decision made by the review team on the documents that the case team believe will hold the most relevant information. The feedback from the ongoing document review is continuously incorporated into the CAL model. Additionally, CAL can be used to continuously prioritize the documents making their way into the review which leads to gradual increase in the number of relevant and important documents a reviewer sees as they progress through the process. This process change is critically important for investigations or matters involving alleged issues that are not fully understood by the legal review team as it improves the speed that they can access critical case documents in a matter, as well as flexibility to adapt the documents being batched for review to new priorities as new information surfaces.


Brute force document review, or linear document review, has been proven to be less accurate and inefficient when compared to TAR models. Nevertheless, there are still instances when TAR is not utilized by a legal team because of the required upfront (and sometimes significant) time commitment involved in the development of a traditional TAR model. The training effort often needs to be repeated, resulting in a time-consuming process during the most critical days of a case. The time lost through this process can be problematic for attorneys to justify, particularly on investigations or complex litigations when time is of the essence during the critical early stages of review.


Identifying and acting on key documents that have a critical impact on the legal strategy of a matter adds additional pressure on the review team to speed up the document review. An important document can impact an array of decisions from how to depose certain witnesses to early settlement decisions. Legal teams will often weigh the time and cost associated with the use of TAR models against starting a review more quickly using less sophisticated document culling strategies, like keyword and concept application, date filtering, etc. CAL changes the calculus associated with these decisions, because it provides the benefits of starting the review immediately with the advantages of a TAR model that will limit the total documents to be reviewed on the matter.


CAL turbocharges the speed and effectiveness of a document review. Not only does it prevent the review of multiple training models at the outset of a matter needed to train a traditional TAR model, it also can be used to continuously improve the number of relevant documents a reviewer sees in each batch of documents pulled during a review, which is particularly important during the critical early days of a matter. By continuously improving the precision of each batch a reviewer pulls to review, the model brings to light relevant documents early in the review and provides the legal team with the opportunity to act and make legal decisions based on the documents identified in the early stages of the document review. Further, dynamically batching documents in this way improves the efficiency of the document review team by presenting fewer nonresponsive documents within a review batch. Additionally, CAL models are not limited to relevance decisions. They can be created and applied to any document decision being used to classify documents in a review including case issues, compliance notations, hot or importance flags, etc.


Investigations can benefit greatly through the application of CAL models because of their ability to find the most relevant documents early in a review. Likewise, smaller document review efforts can particularly benefit through the use of CAL, because the upfront model training is not required to gain the benefits of a TAR model.


The document review process is time consuming, expensive, and requires a high degree of discerning accuracy. Using TAR models to improve on the costs associated with document review by eliminating large populations of irrelevant documents from having to be reviewed have made them an important component of a successful document review effort. CAL improves the traditional TAR application by eliminating the early sample training process and allowing the review to proceed as quickly as possible. CAL can also be applied to smaller matters to improve the speed and efficiency associated with smaller scale document reviews. When your next document review kicks off, look for a document review system that offers CAL and dynamic batching of review sets along with a team of trusted advisors to guide you through the process. Not only will your clients enjoy a lower invoice associated with the document review effort, but your legal decisions will be enhanced by having access to key documents early on in the review.

71 views0 comments

Comments


Post: Blog2_Post
bottom of page