{"id":1715,"date":"2019-06-24T02:39:02","date_gmt":"2019-06-24T02:39:02","guid":{"rendered":"http:\/\/blogs.dickinson.edu\/dcc\/?p=1715"},"modified":"2019-06-24T12:34:09","modified_gmt":"2019-06-24T12:34:09","slug":"concordance-liberation-terence","status":"publish","type":"post","link":"https:\/\/blogs.dickinson.edu\/dcc\/2019\/06\/24\/concordance-liberation-terence\/","title":{"rendered":"Concordance Liberation: Terence"},"content":{"rendered":"<div id=\"attachment_1730\" style=\"width: 362px\" class=\"wp-caption alignright\"><a href=\"https:\/\/www.metmuseum.org\/art\/collection\/search\/715048\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-1730\" class=\"size-full wp-image-1730\" src=\"http:\/\/blogs.dickinson.edu\/dcc\/files\/2019\/06\/Thalia.jpg\" alt=\"Scantily clad young woman holding theatrical mask.\" width=\"352\" height=\"600\" srcset=\"https:\/\/blogs.dickinson.edu\/dcc\/files\/2019\/06\/Thalia.jpg 352w, https:\/\/blogs.dickinson.edu\/dcc\/files\/2019\/06\/Thalia-176x300.jpg 176w\" sizes=\"auto, (max-width: 352px) 100vw, 352px\" \/><\/a><p id=\"caption-attachment-1730\" class=\"wp-caption-text\">Thalia, Muse of Comedy, from the Goddesses of the Greeks and Romans series (N188) issued by Wm. S. Kimball &amp; Co.,1889. Metropolitan Museum.<\/p><\/div>\n<p>The plays of <a href=\"https:\/\/en.wikipedia.org\/wiki\/Terence\">Terence<\/a> (P. Terentius Afer) are widely admired for their pure Latin style, but there is as yet no parsed text in digital form that would permit valid statistical analysis of his language and the creation of accurate vocabulary lists to ease reading via tools like <a href=\"https:\/\/bridge.haverford.edu\/\">The Bridge<\/a>. If and when DCC publishes an edition of a play of Terence, having a text in which each word form is associated with its correct dictionary headword (lemma) will make the creation of the vocabulary lists a relative snap. Computers can&#8217;t accurately parse texts on their own, but humans used to do it routinely in the genre of book known as the <a href=\"https:\/\/classicalstudies.org\/scs-blog\/christopher-francese\/blog-flight-concordances-resurrecting-classical-concordance-online\">concordance or index verborum<\/a>. With the help of Dickinson computer scientist Michael Skalak, Bret Mulligan of Haverford and I have been working on <a href=\"https:\/\/gitclassical.github.io\/ConcordanceLiberation\/\">project<\/a> to convert older concordances and indices verborum into parsed texts by essentially unscrambling them so they are organized by text location rather then alphabetically by headword, and putting the data into an openly published and freely available spreadsheet. We have successfully completed the transformation of print concordances to <a href=\"https:\/\/github.com\/GitClassical\/ConcordanceLiberation\/tree\/master\/Concordances\/Lucretius\">Lucretius<\/a>, <a href=\"https:\/\/github.com\/GitClassical\/ConcordanceLiberation\/tree\/master\/Concordances\/Apuleius\">Apuleius<\/a>, and <a href=\"https:\/\/github.com\/GitClassical\/ConcordanceLiberation\/tree\/master\/Concordances\/Eutropius\">Eutropius<\/a>, and now we are on to Terence, based on a professionally digitized version of <em>Index Verborum Terentianus<\/em> by Edgar B. Jenkins (Chapel Hill: The University of North Carolina Press, 1932, Pp. ix +187). (<a href=\"https:\/\/www.worldcat.org\/title\/index-verborum-terentianus\/oclc\/907586121&amp;referer=brief_results\">Worldcat record<\/a>).<\/p>\n<p>Jenkins&#8217; book was meticulous, and it was well-received. Writing in <em>Classical Review<\/em> <a href=\"https:\/\/www.cambridge.org\/core\/journals\/classical-review\/article\/an-index-to-terence-index-verborum-terentianus-by-jenkinsedgar-b-phd-pp-ix-187-chapel-hill-the-university-of-north-carolina-press-1932-cloth-250\/FD3F75C608FEBD49E3EC91945DD8464E\">47.1 (1933) 22-23<\/a> J.D. Craig called it &#8220;a miracle of compression without obscurity,&#8221; and he spotted only a small number of errors. Jenkins based his index on the text of Knauer and Lindsay, which is still in use (and <a href=\"https:\/\/latin.packhum.org\/author\/134\">on PHI<\/a>). In each case, transformation from an alphabetical word list into a sequential parsed text requires careful examination of the system of listing lemmas, word forms, citations, and textual variants. Classical concordances are all slightly different in the conventions they employ.<\/p>\n<p>The main peculiarity of Jenkins&#8217; books is that he used a system of hyphenation, presumably to save space. This will have to be overcome by alteration of the base code for Michael Skalak&#8217;s Concordance Processor (<a href=\"https:\/\/github.com\/GitClassical\/ConcordanceLiberation\/tree\/master\">code on Github<\/a>). For my part, I had to filter out some information that was evidently important to Jenkins, but is not to us. For example, Jenkins put in parentheses all citations for words that are in parentheses in the text itself. Whether or not a word is in parentheses is immaterial to us, and having those citations in parentheses would have meant those citations were misinterpreted by the processor.<\/p>\n<p>For the benefit of anybody who wants to try to do this kind of work in the future (and there are innumerable concordances that could be liberated in this way), here are my working notes and analysis of Jenkins. A random chunk of the .pdf looks like this:<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-1717 size-full\" src=\"http:\/\/blogs.dickinson.edu\/dcc\/files\/2019\/06\/scirpus.png\" alt=\"selection from Terence concordance\" width=\"329\" height=\"671\" srcset=\"https:\/\/blogs.dickinson.edu\/dcc\/files\/2019\/06\/scirpus.png 329w, https:\/\/blogs.dickinson.edu\/dcc\/files\/2019\/06\/scirpus-147x300.png 147w\" sizes=\"auto, (max-width: 329px) 100vw, 329px\" \/>After digitization by <a href=\"https:\/\/www.newgen.co\/about-us.html\">NewGen Knowledge Works<\/a> it looks like this:<\/p>\n<p>&lt;il&gt;&lt;B&gt;scirp-us:&lt;\/B&gt;&lt;\/il&gt;<br \/>\n&lt;il&gt; -o (ab): An 941&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;Scirt-us:&lt;\/B&gt;&lt;\/il&gt;<br \/>\n&lt;il&gt; -e: Hc 78&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;sciscit-or:&lt;\/B&gt;&lt;\/il&gt;<br \/>\n&lt;il&gt; -ari: E 548&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;scite&lt;\/B&gt; (3): Ht 729 764 785&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;scit-us&lt;\/B&gt; (pa; 5):&lt;\/il&gt;<br \/>\n&lt;il&gt; -a (ns): P 110&lt;\/il&gt;<br \/>\n&lt;il&gt; -um (ac): E 254&lt;\/il&gt;<br \/>\n&lt;il&gt; -um (n): Ht 210; P 821&lt;\/il&gt;<br \/>\n&lt;il&gt; -us: An 486 (in tmesis w per)&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;scopul-us:&lt;\/B&gt;&lt;\/il&gt;<br \/>\n&lt;il&gt; -um: P 689(4)&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;scort-or&lt;\/B&gt; (2):&lt;\/il&gt;<br \/>\n&lt;il&gt; -ari: Ad 102; Ht 206&lt;\/il&gt;<br \/>\n&lt;il&gt; -atur: Ad 117(F)&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;scortum&lt;\/B&gt; (ac; 2): Ad 965; E 424&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;screatus&lt;\/B&gt; (ac): Ht 373&lt;\/il&gt;<br \/>\n&lt;il&gt;&lt;B&gt;scrib-o&lt;\/B&gt; (19):&lt;\/il&gt;<br \/>\n&lt;il&gt; -am (ind): P 127&lt;\/il&gt;<br \/>\n&lt;il&gt; -at: P 3&lt;\/il&gt;<br \/>\n&lt;il&gt; -endo (g ab): E 7&lt;\/il&gt;<br \/>\n&lt;il&gt; -endum (g): Ad 25; An 1&lt;\/il&gt;<br \/>\n&lt;il&gt; -ere: Ad 16; E 36; Hc 56&lt;\/il&gt;<br \/>\n&lt;il&gt; -eret: Hc 27&lt;\/il&gt;<br \/>\n&lt;il&gt; -ito (3): P 668&lt;\/il&gt;<br \/>\n&lt;il&gt; -undis (ab): An 5&lt;\/il&gt;<br \/>\n&lt;il&gt; -unt: Ht 43&lt;\/il&gt;<br \/>\n&lt;il&gt; scripserit (subj): Hc 7a(DT); Ht 7&lt;\/il&gt;<br \/>\n&lt;il&gt; scripsit: E 10; Hc 6; Ht 15; P 6&lt;\/il&gt;<br \/>\n&lt;il&gt; scripta (sunt): An 283&lt;\/il&gt;<br \/>\n&lt;il&gt; scriptam (sc esse): P 329&lt;\/il&gt;<\/p>\n<p>Skalak&#8217;s concordance processor will convert this into a spreadsheet with each piece of information in its proper category: lemma or headword (column 1), lemma homonym distinguisher, if any (column 2), citation for specific word forms (column 3), the word forms (column 4), word form homonym distinguishers or other information about a single word form (column 5), and textual variant information (column 6). The trick to the pre-processing analysis is to find the machine-readable characteristics of each kind of information, so the processor can be adjusted to the specific conventions used by the index. Examination revealed the following:<\/p>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Lemmas <\/span><b>[column 1] <\/b><span style=\"font-weight: 400\">are introduced by &lt;il&gt;&lt;B&gt; and terminated by a colon. The closing &lt;\/B&gt; tag may follow or precede the colon, but it will always be there. Only lemmas are enclosed with &lt;B&gt;&#8230;&lt;\/B&gt; tags. The colon is followed by &lt;\/il&gt;, &lt;\/B&gt;&lt;\/il&gt;, or by one or more citations and &lt;\/il&gt;.Examples:<\/span>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;<\/span><span style=\"font-weight: 400\">abrad-o<\/span><span style=\"font-weight: 400\">:&lt;\/B&gt;&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;<\/span><span style=\"font-weight: 400\">a<\/span><span style=\"font-weight: 400\">&lt;\/B&gt; (prep; 87):&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;<\/span><span style=\"font-weight: 400\">accurate<\/span><span style=\"font-weight: 400\">:&lt;\/B&gt; An 494&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;<\/span><span style=\"font-weight: 400\">abhinc<\/span><span style=\"font-weight: 400\">&lt;\/B&gt; (3): An 69; Hc 822; P 1017&lt;\/il&gt;<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Lemma distinguishers<\/span> <b>[column 2]<\/b><span style=\"font-weight: 400\"> sometimes precede (but never follow) the colon, and are in parentheses. This either indicates the number of times that the lemma occurs, or homonym distinguishers, or textual information, or some combination of the three, set apart with semicola. This info needs to go in column 2 next to every word form under that lemma.<\/span>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;ac-er&lt;\/B&gt; <\/span><span style=\"font-weight: 400\">(2)<\/span><span style=\"font-weight: 400\">:&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;act-us&lt;\/B&gt; <\/span><span style=\"font-weight: 400\">(subs)<\/span><span style=\"font-weight: 400\">:&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;ad-eo&lt;\/B&gt; <\/span><span style=\"font-weight: 400\">(verb; 26)<\/span><span style=\"font-weight: 400\">:&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;dehinc&lt;\/B&gt; <\/span><span style=\"font-weight: 400\">(de(h)inc=KL; 8)<\/span><span style=\"font-weight: 400\">: Ad 22; An 22 79(dein=4) 190 562 (dein=4); E 14 296 872&lt;\/il&gt;<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Word forms<\/span> <b>[column 4] <\/b><span style=\"font-weight: 400\">sometimes directly follow the lemma after the colon and before the closing &lt;\/il&gt; tag (as just above). But in most cases they are listed on a new line, preceded by &lt;il&gt; and a tab, and followed by a colon. <\/span>\n<ul>\n<li><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;depecto:&lt;\/B&gt;&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">depexum (ac)<\/span><span style=\"font-weight: 400\">: Ht 951&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;deper-eo:&lt;\/B&gt;&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-it<\/span><span style=\"font-weight: 400\">: Ht 525&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;delir-o&lt;\/B&gt; (5):&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-ans (n)<\/span><span style=\"font-weight: 400\">: Ad 761&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-as<\/span><span style=\"font-weight: 400\">: Ad 936; An 752; P 801&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-at<\/span><span style=\"font-weight: 400\">: P 997&lt;\/il&gt;<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Word form modifiers <\/span><b>[column 5] <\/b><span style=\"font-weight: 400\">sometimes follow the word form in parentheses, before the colon. This information can be syntactical (most common) or textual, can indicate matter to be assumed, differentiate homonymns, or indicate frequency. Put this in column 4 next to every instance of the word form. <\/span>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">aspexerit (subj): Ht 773&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-andus (est): Ad 709&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;ante (adv; 6): An 239 556; E 733; Hc 146 581; P 4(antehac=DU)&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-quid (-quit=U sometimes; ac): Ad 38 150 401 518 856 857 948 980; An 250 259 265(om=DU) 615 622 640; E 210 308 661 999 1001; Hc 333; Ht 69 339 533 670 763 1003; P 42 190 770 874&lt;\/il&gt;<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Citations for instances of a word form<\/span> <b>[column 3]<\/b><span style=\"font-weight: 400\"> in each of the six plays follow the colon. Semicola separate instances for each play. Multiple citations from a single play are separated by a space only. &lt;\/il&gt; closes off the word form.<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;de-us&lt;\/B&gt; (121):&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-o (ab): <\/span><span style=\"font-weight: 400\">P 74<\/span><span style=\"font-weight: 400\">&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-orum: <\/span><span style=\"font-weight: 400\">An 959(sp=U); Ht 693<\/span><span style=\"font-weight: 400\">&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-os: <\/span><span style=\"font-weight: 400\">Ad 275 298 491 693 699 704; An 487 522 538 664 694 834; Hc 476 772 772; Ht 879 1038; P 311 764<\/span><span style=\"font-weight: 400\">&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">The string &#8220;<\/span><span style=\"font-weight: 400\">ae<\/span><span style=\"font-weight: 400\">&#8221; followed directly by numerals (no spaces) should be treated as part of the numeral. This indicates the line numbers in the alternate ending of the Andria. Some line numbers will have a letter suffix, like 7a, 7b<\/span>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-averis (ind): An <\/span><span style=\"font-weight: 400\">ae16<\/span><span style=\"font-weight: 400\">&lt;\/il&gt;<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Citation modifiers in parentheses <\/span><b>[column 6]<\/b><span style=\"font-weight: 400\">.These are all textual variants. Depending on what it says, sometimes <\/span><span style=\"font-weight: 400\">the parenthetical material only will be deleted<\/span><span style=\"font-weight: 400\">, sometimes <\/span><span style=\"font-weight: 400\">the citation will be deleted as well<\/span><span style=\"font-weight: 400\">. This can be done after the creation of the spreadsheet. If the citation-distinguishing parenthesis in column 6 contains &#8216;=&#8217;, delete just the parenthesis. If it does not contain &#8216;=&#8217;, delete the entire citation and the parenthesis. Column 6 will then be gone.<\/span>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;ergo&lt;\/B&gt; (38): Ad <\/span><span style=\"font-weight: 400\">172(ego me=F)<\/span><span style=\"font-weight: 400\"> 324 <\/span><span style=\"font-weight: 400\">325(FT)<\/span><span style=\"font-weight: 400\"> 326 572 609 854 959; An 195 565 711 850; E 162 317 401 459 796 1062; Hc 63 610 611 715 <\/span><span style=\"font-weight: 400\">787(4)<\/span><span style=\"font-weight: 400\">; Ht 398 550 821 985 <\/span><span style=\"font-weight: 400\">993 (ego=FU)<\/span><span style=\"font-weight: 400\"> 1046; P 62 202 539 562 685 718 755 882 948 984 995&lt;\/il&gt;<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;et&lt;\/B&gt; (538): Ad 2 19 30 34 <\/span><span style=\"font-weight: 400\">35(om=4)<\/span><span style=\"font-weight: 400\"> 43 57 64 65 68 78 107 121 <\/span><span style=\"font-weight: 400\">121(F)<\/span><span style=\"font-weight: 400\"> 122 129 138 144 207 230 251 263 272 <\/span><span style=\"font-weight: 400\">279(F)<\/span><span style=\"font-weight: 400\"> 285 285 305 316 319 340 352 380 <\/span><span style=\"font-weight: 400\">389 (U)<\/span><span style=\"font-weight: 400\"> 391 423 429 446 495 511 521 523 558 566 580 <\/span><span style=\"font-weight: 400\">584(ei=F)<\/span><span style=\"font-weight: 400\"> 591 596 <\/span><span style=\"font-weight: 400\">600(esse=FTU)<\/span><span style=\"font-weight: 400\"> 602 603 609 609 648 675 680 683 <\/span><span style=\"font-weight: 400\">692 (4)<\/span><\/li>\n<\/ul>\n<\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Lemmas sometimes have hyphens to indicate that subsequent inflected forms may be abbreviated. They may or may not actually be abbreviated. Word forms can be reconstructed by combining.<\/span>\n<ul>\n<li><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;adfer-o&lt;\/B&gt; (25):&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-: Ht 223&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-am (ind): Ht 701&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">-ant: Ad 300&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;&lt;B&gt;admitt-o&lt;\/B&gt; (13):&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">admiserit (subj): P 270&lt;\/il&gt;<\/span><\/li>\n<li><span style=\"font-weight: 400\">&lt;il&gt;<\/span> <span style=\"font-weight: 400\">admisero: E 853&lt;\/il&gt;<\/span><\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>I made some alterations to the concordance to make it easier to process:<\/p>\n<ul>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">To avoid confusion, citations that are themselves in parentheses had to be removed from parentheses. Otherwise they will be treated as supplementary info for the previous citation.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Alphabetic headings had to be removed, since they looked superficially like lemmas.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">Spurious lines in square brackets were removed.<\/span><\/li>\n<li style=\"font-weight: 400\"><span style=\"font-weight: 400\">All 59 instances of &#8220;*&#8221; were removed. The asterisk indicates that some minor point applies, e.g. that <em>est<\/em> is to be inferred with <em>factum<\/em>, or that <em>ipsa<\/em> is spelled <em>eapse<\/em> in F&#8217;s edition. This information was not significant enough for our purpose, which was to get each word form sitting next to its proper lemma.<\/span><\/li>\n<\/ul>\n<p>After the spreadsheet is done I&#8217;ll check it, then hand it over to Bret Mulligan, who will ingest the parsed text into the Bride, adding Bridge display lemmas and definitions. Custom vocabulary lists can be created from there. The original .txt and the spreadsheet version will also be made available on our <a href=\"https:\/\/github.com\/GitClassical\/ConcordanceLiberation\/tree\/master\">Github repository<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The plays of Terence (P. Terentius Afer) are widely admired for their pure Latin style, but there is as yet no parsed text in digital form that would permit valid statistical analysis of his language and the creation of accurate &hellip; <a href=\"https:\/\/blogs.dickinson.edu\/dcc\/2019\/06\/24\/concordance-liberation-terence\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":65,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"ngg_post_thumbnail":0,"footnotes":""},"categories":[153085,1],"tags":[61802],"class_list":["post-1715","post","type-post","status-publish","format-standard","hentry","category-concordance-liberation","category-uncategorized","tag-terence"],"_links":{"self":[{"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/posts\/1715","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/users\/65"}],"replies":[{"embeddable":true,"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/comments?post=1715"}],"version-history":[{"count":0,"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/posts\/1715\/revisions"}],"wp:attachment":[{"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/media?parent=1715"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/categories?post=1715"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blogs.dickinson.edu\/dcc\/wp-json\/wp\/v2\/tags?post=1715"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}