<!DOCTYPE html> <!-- | Generated by Apache Maven Doxia Site Renderer 1.7.4 at 2017-12-19 | Rendered using Apache Maven Fluido Skin 1.5 --> <html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en"> <head> <meta charset="UTF-8" /> <meta name="viewport" content="width=device-width, initial-scale=1.0" /> <meta name="Date-Revision-yyyymmdd" content="20171219" /> <meta http-equiv="Content-Language" content="en" /> <title>Multi Tier Annotation Search – Mapping</title> <link rel="stylesheet" href="./css/apache-maven-fluido-1.5.min.css" /> <link rel="stylesheet" href="./css/site.css" /> <link rel="stylesheet" href="./css/print.css" media="print" /> <script type="text/javascript" src="./js/apache-maven-fluido-1.5.min.js"></script> </head> <body class="topBarDisabled"> <div class="container-fluid"> <div id="banner"> <div class="pull-left"> <div id="bannerLeft"> <h2>MTAS</h2> </div> </div> <div class="pull-right"> <a href="http://www.meertens.knaw.nl/" id="bannerRight"> <img src="images/meertens.png" alt="Meertens Instituut" width="93" height="104"/> </a> </div> <div class="clear"><hr/></div> </div> <div id="breadcrumbs"> <ul class="breadcrumb"> <li id="projectVersion">Version: 7.1.0.5-SNAPSHOT <span class="divider">|</span> </li> <li class=""> <a href="index.html" title="Mtas"> Mtas</a> <span class="divider">/</span> </li> <li class="active ">Mapping</li> <li id="publishDate" class="pull-right">Last Published: 2017-12-19</li> </ul> </div> <div class="row-fluid"> <div id="leftColumn" class="span2"> <div class="well sidebar-nav"> <ul class="nav nav-list"> <li class="nav-header">Mtas</li> <li> <a href="index.html" title="Introduction"> <span class="none"></span> Introduction</a> </li> <li> <a href="features.html" title="Features"> <span class="none"></span> Features</a> </li> <li> <a href="installation.html" title="Getting started"> <span class="icon-chevron-right"></span> Getting started</a> </li> <li> <a href="indexing.html" title="Indexing"> <span class="icon-chevron-down"></span> Indexing</a> <ul class="nav nav-list"> <li> <a href="indexing_configuration.html" title="Configuration"> <span class="none"></span> Configuration</a> </li> <li class="active"> <a href="#"><span class="none"></span>Mapping</a> </li> <li> <a href="indexing_formats.html" title="Formats"> <span class="icon-chevron-down"></span> Formats</a> <ul class="nav nav-list"> <li> <a href="indexing_formats_folia.html" title="FoLiA"> <span class="none"></span> FoLiA</a> </li> <li> <a href="indexing_formats_tei.html" title="TEI"> <span class="none"></span> TEI</a> </li> <li> <a href="indexing_formats_chat.html" title="CHAT"> <span class="none"></span> CHAT</a> </li> <li> <a href="indexing_formats_sketch.html" title="Sketch"> <span class="none"></span> Sketch</a> </li> <li> <a href="indexing_formats_crm.html" title="CRM"> <span class="none"></span> CRM</a> </li> </ul> </li> </ul> </li> <li> <a href="search.html" title="Search"> <span class="icon-chevron-right"></span> Search</a> </li> <li> <a href="download.html" title="Download"> <span class="none"></span> Download</a> </li> <li class="nav-header">Project Documentation</li> <li> <a href="project-info.html" title="Project Information"> <span class="icon-chevron-right"></span> Project Information</a> </li> <li> <a href="project-reports.html" title="Project Reports"> <span class="icon-chevron-right"></span> Project Reports</a> </li> </ul> <hr /> <div id="poweredBy"> <div class="clear"></div> <div class="clear"></div> <div class="clear"></div> <div class="clear"></div> <a href="http://maven.apache.org/" title="Built by Maven" class="poweredBy"> <img class="builtBy" alt="Built by Maven" src="./images/logos/maven-feather.png" /> </a> </div> </div> </div> <div id="bodyColumn" class="span10" > <h1>Mapping</h1> <p>Illustration of a possible mapping from the resource sample listed <a href="indexing_configuration.html">before</a> to the index structure. </p> <p>The <i>RealOffset</i> is the real position of the element in the resource, the <i>Offset</i> is the position adjusted to the Lucene requirement that all elements with the same starting position should have the same starting offset. </p> <table border="0" class="table table-striped"> <thead> <tr class="a"> <th>Id</th> <th>RealOffset</th> <th>Offset</th> <th>Position</th> <th>Parent</th> <th>Payload</th> <th>Prefix</th> <th>Postfix</th> </tr> </thead> <tbody> <tr class="b"> <td>643</td> <td>1985-40696</td> <td>2154-39638</td> <td>0</td> <td colspan="2">151</td> <td>div</td> <td>chapter</td> </tr> <tr class="a"> <td>5</td> <td>2049-2377</td> <td>2154-2349</td> <td>0</td> <td colspan="2">643</td> <td colspan="2">head</td> </tr> <tr class="b"> <td>3</td> <td>2298-2313</td> <td>2154-2349</td> <td>0</td> <td colspan="2">4</td> <td>lemma</td> <td>stemma</td> </tr> <tr class="a"> <td>2</td> <td>2242-2257</td> <td>2154-2349</td> <td>0</td> <td colspan="2">4</td> <td>pos</td> <td>N(soort,ev,basis,onz,stan)</td> </tr> <tr class="b"> <td>4</td> <td>2100-2364</td> <td>2154-2349</td> <td>0</td> <td colspan="2">5</td> <td colspan="2">s</td> </tr> <tr class="a"> <td>0</td> <td>2214-2238</td> <td>2154-2349</td> <td>0</td> <td colspan="2">4</td> <td>t</td> <td>Stemma</td> </tr> <tr class="b"> <td>1</td> <td>2214-2238</td> <td>2154-2349</td> <td>0</td> <td colspan="2">4</td> <td>t_lc</td> <td>stemma</td> </tr> <tr class="a"> <td>41</td> <td>5372-5734</td> <td>2503-2875</td> <td>1-2</td> <td colspan="2">57</td> <td>dependency</td> <td>su</td> </tr> <tr class="b"> <td>40</td> <td>5583-5711</td> <td>2503-2698</td> <td>1</td> <td colspan="2">41</td> <td colspan="2">dependency.dep</td> </tr> <tr class="a"> <td>9</td> <td>2646-2661</td> <td>2503-2698</td> <td>1</td> <td colspan="2">57</td> <td>lemma</td> <td>Stemma</td> </tr> <tr class="b"> <td>642</td> <td>2384-39666</td> <td>2503-39638</td> <td>1-151</td> <td colspan="2">643</td> <td>p</td> <td>firstparagraph</td> </tr> <tr class="a"> <td>8</td> <td>2588-2603</td> <td>2503-2698</td> <td>1</td> <td colspan="2">57</td> <td>pos</td> <td>N(eigen,ev,basis,zijd,stan)</td> </tr> <tr class="b"> <td>57</td> <td>2452-9269</td> <td>2503-4087</td> <td>1-8</td> <td colspan="2">642</td> <td colspan="2">s</td> </tr> <tr class="a"> <td>6</td> <td>2560-2584</td> <td>2503-2698</td> <td>1</td> <td colspan="2">57</td> <td>t</td> <td>Stemma</td> </tr> <tr class="b"> <td>7</td> <td>2560-2584</td> <td>2503-2698</td> <td>1</td> <td colspan="2">57</td> <td>t_lc</td> <td>stemma</td> </tr> <tr class="a"> <td>44</td> <td>5747-6111</td> <td>2702-3444</td> <td>2,5</td> <td colspan="2">57</td> <td>dependency</td> <td>predc</td> </tr> <tr class="b"> <td>39</td> <td>5455-5578</td> <td>2702-2875</td> <td>2</td> <td colspan="2">41</td> <td colspan="2">dependency.hd</td> </tr> <tr class="a"> <td>42</td> <td>5833-5956</td> <td>2702-2875</td> <td>2</td> <td colspan="2">44</td> <td colspan="2">dependency.hd</td> </tr> <tr class="b"> <td>13</td> <td>2826-2841</td> <td>2702-2875</td> <td>2</td> <td colspan="2">57</td> <td>lemma</td> <td>zijn</td> </tr> <tr class="a"> <td>12</td> <td>2783-2798</td> <td>2702-2875</td> <td>2</td> <td colspan="2">57</td> <td>pos</td> <td>WW(pv,tgw,ev)</td> </tr> <tr class="b"> <td>10</td> <td>2759-2779</td> <td>2702-2875</td> <td>2</td> <td colspan="2">57</td> <td>t</td> <td>is</td> </tr> <tr class="a"> <td>11</td> <td>2759-2779</td> <td>2702-2875</td> <td>2</td> <td colspan="2">57</td> <td>t_lc</td> <td>is</td> </tr> <tr class="b"> <td>…</td> <td>…</td> <td>…</td> <td>…</td> <td>…</td> <td>…</td> <td>…</td> <td>…</td> </tr> </tbody> </table> <p>The payload above, although never set in this example, is not the regular payload as known from Lucene. All additional elements like RealOffset, range or set of positions and the reference to a parent element are encoded together with this optional Mtas specific payload into a classic Lucene payload. This enables future use of for example confidence levels on annotations.</p> </div> </div> </div> <hr/> <footer> <div class="container-fluid"> <div class="row-fluid"> <p >Copyright © 2017 <a href="http://www.meertens.knaw.nl/">Meertens Institute</a>. All rights reserved. </p> </div> </div> </footer> </body> </html>