R version 2.15.2 (2012-10-26) -- "Trick or Treat" Copyright (C) 2012 The R Foundation for Statistical Computing ISBN 3-900051-07-0 Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit) R is free software and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute it under certain conditions. Type 'license()' or 'licence()' for distribution details. Natural language support but running in an English locale R is a collaborative project with many contributors. Type 'contributors()' for more information and 'citation()' on how to cite R or R packages in publications. Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R. [R.app GUI 1.53 (6335) x86_64-apple-darwin9.8.0] [Workspace restored from /Users/Erik/.RData] [History restored from /Users/Erik/.Rapp.history] > source("/Users/Erik/Downloads/Ted_Underwood_Files_for_R/NassrProgram.R") You'll need to find the necessary data files on your own computer. Ready to select NassrMetadata? yes Ready to select NassrData? Unpacking the data. This may take five to ten minutes.Here are a few of the most common authors in this dataset.Authors MoreH HawthorneN ScottW CooperJ HayleyW SedgwickC HolcroftT Anonymous 55 33 32 22 20 20 18 14 PrattM ByronG DickensC EdgeworthM ThackerayW DibdinC InchbaldM LennoxC 14 13 13 13 11 10 10 10 AustenJ JerninghamM IrvingW RadcliffeA 9 9 8 8 When prompted for an author name you can say "authors" to repeat that list.For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: EdgeworthM Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 harry 10098 0.693 2 lucy 7956 0.676 3 lady 5269 0.837 4 ormond 5081 0.598 5 percy 4819 0.49 6 belinda 4383 0.523 7 hervey 3141 0.527 8 falconer 2813 0.519 9 said 2783 0.852 10 leonora 2591 0.634 11 you 2467 0.868 12 caroline 2427 0.552 13 clarence 2349 0.519 14 lord 2033 0.496 15 helen 1930 0.555 16 portman 1834 0.522 17 rosamond 1792 0.52 18 rupert 1641 0.638 19 ladyship 1637 0.824 20 alfred 1458 0.53 21 commissioner 1451 0.508 22 count 1367 0.577 23 she 1099 0.79 24 percival 1016 0.524 25 vincent 988 0.548 26 cecilia 883 0.583 27 thing 811 0.835 28 not 802 0.853 29 dear 792 0.88 30 could 754 0.913 31 virginia 732 0.492 32 sir 715 0.7 33 godfrey 677 0.513 34 mamma 668 0.735 35 ireland 666 0.691 36 any 632 0.85 37 miss 627 0.767 38 olivia 612 0.566 39 lordship 606 0.6 40 do 586 0.884 41 steam 559 0.584 42 it 531 0.807 43 know 523 0.916 44 that 514 0.835 45 am 491 0.82 46 whilst 419 0.744 47 barclay 406 0.517 48 pump 388 0.602 49 all 387 0.77 50 be 383 0.824 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 understand 0.937 271 2 recollect 0.923 309 3 talking 0.916 127 4 know 0.916 523 5 could 0.913 754 6 provoking 0.912 41.9 7 nonsense 0.911 62.3 8 perfectly 0.905 119 9 explain 0.903 192 10 continually 0.889 95.4 11 tired 0.888 76 12 going 0.888 205 13 do 0.884 586 14 dear 0.88 792 15 sorry 0.879 79.5 16 satisfied 0.879 93.8 17 yesterday 0.879 48.9 18 liked 0.875 48.1 19 spoiled 0.874 19.6 20 directly 0.869 77.2 21 quite 0.869 136 22 please 0.868 182 23 you 0.868 2467 24 repeated 0.868 233 25 decide 0.866 101 26 afraid 0.864 148 27 repeating 0.862 52.7 28 thank 0.862 115 29 manage 0.86 44 30 guess 0.86 97.8 31 sure 0.859 290 32 ashamed 0.857 35.4 33 put 0.856 140 34 admiration 0.855 90.5 35 disappointed 0.855 44.8 36 surprised 0.855 75.6 37 tiresome 0.853 37.2 38 especially 0.853 76.3 39 not 0.853 802 40 reading 0.853 80.1 41 dressing 0.852 9.04 42 said 0.852 2783 43 formerly 0.851 50 44 understanding 0.851 103 45 possible 0.85 157 46 because 0.85 261 47 really 0.85 125 48 any 0.85 632 49 saw 0.85 183 50 think 0.85 173 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: LennoxC Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 mr 2961 0.943 2 my 2019 0.743 3 howard 1837 0.716 4 harley 1787 0.629 5 greville 1495 0.641 6 mrs 1483 0.932 7 neville 945 0.639 8 fanny 894 0.815 9 me 870 0.799 10 aubrey 821 0.591 11 sophia 663 0.725 12 lady 564 0.88 13 wilmot 556 0.831 14 extremely 548 0.984 15 her 534 0.826 16 seymour 510 0.679 17 uncle 503 0.767 18 hero 444 0.523 19 wholly 421 0.961 20 behaviour 419 0.954 21 our 419 0.794 22 this 367 0.904 23 which 356 0.749 24 amiable 353 0.959 25 to 326 0.86 26 letter 312 0.916 27 madam 306 0.923 28 benson 271 0.636 29 myself 263 0.827 30 situation 260 0.924 31 conversation 258 0.957 32 george 258 0.492 33 us 255 0.734 34 maria 235 0.62 35 informed 232 0.958 36 miss 224 0.893 37 regard 212 0.812 38 instantly 209 0.927 39 company 196 0.896 40 conduct 195 0.929 41 satisfaction 187 0.967 42 family 185 0.895 43 bradshaw 181 0.69 44 louisa 178 0.76 45 however 176 0.81 46 she 174 0.814 47 uneasiness 171 0.889 48 dear 166 0.779 49 so 155 0.752 50 convinced 144 0.837 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 extremely 0.984 548 2 civility 0.97 117 3 apprehensive 0.97 96.4 4 satisfaction 0.967 187 5 warmest 0.961 80.3 6 wholly 0.961 421 7 amiable 0.959 353 8 accompany 0.959 55.8 9 informed 0.958 232 10 relating 0.957 83.3 11 conversation 0.957 258 12 behaviour 0.954 419 13 mortified 0.949 34.6 14 mortification 0.948 113 15 received 0.945 119 16 mr 0.943 2961 17 opportunity 0.942 76.5 18 amusements 0.939 32.3 19 entreaties 0.937 54.9 20 apprehensions 0.937 89.4 21 attentions 0.936 70.9 22 consequences 0.933 44.2 23 degree 0.933 99.7 24 mrs 0.932 1483 25 scheme 0.932 55 26 imagine 0.93 114 27 conduct 0.929 195 28 insisted 0.928 80.6 29 instantly 0.927 209 30 countenance 0.925 123 31 situation 0.924 260 32 madam 0.923 306 33 visit 0.923 107 34 ladies 0.922 114 35 arrival 0.922 83.5 36 acknowledged 0.92 53 37 reception 0.92 46.8 38 circumstance 0.919 98.7 39 emotions 0.918 50.8 40 concluded 0.917 115 41 relations 0.917 84.3 42 letter 0.916 312 43 politeness 0.916 110 44 shocked 0.914 89.2 45 accident 0.913 74.1 46 inform 0.913 74.8 47 acquaintance 0.912 131 48 afforded 0.912 50.2 49 sentiments 0.91 83.9 50 ordered 0.91 66.6 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: ScottW,fic Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 which 4162 0.913 2 said 4137 0.927 3 wad 4100 0.691 4 abbot 3697 0.75 5 woodstock 3620 0.571 6 ye 3003 0.697 7 hae 2589 0.748 8 of 2555 0.735 9 answered 2519 0.958 10 everard 1858 0.577 11 knight 1698 0.859 12 morton 1631 0.584 13 weel 1605 0.748 14 roland 1601 0.604 15 thou 1569 0.663 16 wi 1553 0.78 17 betwixt 1528 0.958 18 king 1413 0.78 19 sae 1348 0.768 20 master 1346 0.823 21 scotland 1335 0.942 22 your 1308 0.818 23 castle 1258 0.862 24 ay 1182 0.929 25 ken 1148 0.794 26 ashton 1129 0.513 27 bertram 1114 0.53 28 nae 1114 0.727 29 duke 1108 0.756 30 folk 1096 0.797 31 laird 1074 0.851 32 louis 1037 0.644 33 scottish 1035 0.947 34 lovel 1017 0.512 35 apartment 1001 0.939 36 highland 952 0.789 37 glover 945 0.598 38 catharine 929 0.541 39 leicester 915 0.553 40 ain 890 0.75 41 mair 886 0.792 42 earl 878 0.837 43 mr 866 0.645 44 bailie 860 0.778 45 mac 845 0.656 46 ane 836 0.781 47 alan 831 0.511 48 mowbray 823 0.526 49 tyrrel 822 0.537 50 antiquary 790 0.775 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 answered 0.958 2519 2 betwixt 0.958 1528 3 scottish 0.947 1035 4 warrant 0.944 501 5 purpose 0.944 725 6 scotland 0.942 1335 7 apartment 0.939 1001 8 risk 0.93 263 9 ay 0.929 1182 10 said 0.927 4137 11 excepting 0.927 345 12 termed 0.918 161 13 permit 0.914 247 14 trusty 0.913 169 15 which 0.913 4162 16 farther 0.909 572 17 wench 0.907 212 18 ordinary 0.907 223 19 weapon 0.905 235 20 ale 0.903 222 21 formidable 0.903 147 22 boot 0.902 127 23 followers 0.898 505 24 personal 0.897 185 25 safety 0.897 365 26 partly 0.897 239 27 domestics 0.897 122 28 desirous 0.896 215 29 edinburgh 0.895 587 30 commanded 0.895 222 31 hastily 0.894 233 32 courtesy 0.894 262 33 quarrel 0.893 183 34 kinsman 0.892 432 35 assistance 0.892 248 36 otherwise 0.892 150 37 saddle 0.891 109 38 authority 0.891 253 39 trow 0.89 158 40 assuming 0.89 53.6 41 assumed 0.89 124 42 tone 0.89 375 43 displeasure 0.89 123 44 attendance 0.889 162 45 communication 0.889 143 46 hasty 0.889 153 47 willingly 0.889 170 48 somewhat 0.887 247 49 accordingly 0.887 247 50 corresponding 0.887 104 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: authors Authors MoreH HawthorneN ScottW CooperJ HayleyW SedgwickC HolcroftT Anonymous 55 33 32 22 20 20 18 14 PrattM ByronG DickensC EdgeworthM ThackerayW DibdinC InchbaldM LennoxC 14 13 13 13 11 10 10 10 AustenJ JerninghamM IrvingW RadcliffeA 9 9 8 8 An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: MoreH,fic Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 stanley 2807 0.497 2 stock 1283 0.529 3 religion 1136 0.674 4 worthy 717 0.505 5 religious 646 0.898 6 god 619 0.823 7 farmer 558 0.667 8 sunday 556 0.945 9 mr 556 0.669 10 good 540 0.91 11 christian 505 0.681 12 cheap 463 0.957 13 sin 460 0.828 14 brown 457 0.618 15 piety 410 0.607 16 jack 393 0.552 17 betty 378 0.605 18 not 370 0.793 19 bible 349 0.754 20 tyrrel 335 0.509 21 giles 334 0.56 22 john 332 0.448 23 jones 331 0.612 24 christianity 325 0.51 25 principle 310 0.565 26 parish 299 0.804 27 shepherd 294 0.523 28 they 285 0.744 29 hester 276 0.533 30 is 273 0.542 31 poor 270 0.851 32 little 267 0.869 33 money 244 0.729 34 because 244 0.919 35 to 230 0.798 36 daughters 224 0.396 37 things 222 0.848 38 children 220 0.813 39 church 214 0.79 40 sins 205 0.621 41 them 204 0.769 42 master 203 0.637 43 part 203 0.811 44 always 200 0.804 45 worldly 198 0.618 46 sober 196 0.644 47 much 194 0.878 48 own 194 0.783 49 learning 191 0.538 50 vanity 187 0.566 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 cheap 0.957 463 2 sunday 0.945 556 3 because 0.919 244 4 allowance 0.915 92.9 5 good 0.91 540 6 for 0.907 114 7 religious 0.898 646 8 got 0.895 135 9 penny 0.885 120 10 much 0.878 194 11 sold 0.875 69.3 12 little 0.869 267 13 price 0.867 147 14 bath 0.862 27.2 15 out 0.857 104 16 hazard 0.855 32.1 17 marshal 0.852 47.1 18 get 0.851 80.5 19 poor 0.851 270 20 indeed 0.85 116 21 things 0.848 222 22 people 0.839 76.6 23 keep 0.835 54.8 24 sin 0.828 460 25 god 0.823 619 26 bad 0.822 165 27 put 0.818 47.4 28 very 0.816 13 29 way 0.815 51.5 30 set 0.815 102 31 children 0.813 220 32 do 0.812 131 33 comfort 0.811 127 34 part 0.811 203 35 make 0.809 149 36 always 0.804 200 37 parish 0.804 299 38 so 0.799 93.8 39 to 0.798 230 40 churchyard 0.794 45.8 41 be 0.793 158 42 not 0.793 370 43 used 0.792 84.5 44 church 0.79 214 45 help 0.788 30.1 46 sure 0.788 34.9 47 use 0.784 95.6 48 own 0.783 194 49 per 0.783 67.2 50 moral 0.78 48.5 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: CooperJ Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 spike 3653 0.59 2 of 3341 0.806 3 judith 2910 0.524 4 captain 2436 0.815 5 ludlow 2155 0.541 6 maud 2099 0.535 7 bravo 2052 0.584 8 indian 1814 0.711 9 griffith 1716 0.578 10 rifle 1625 0.688 11 brig 1593 0.587 12 nick 1587 0.5 13 boat 1453 0.74 14 scout 1391 0.635 15 vessel 1367 0.745 16 water 1324 0.8 17 manner 1224 0.972 18 canoe 1184 0.683 19 delaware 1175 0.755 20 hurry 1124 0.679 21 willoughby 1112 0.53 22 wharton 1103 0.53 23 ship 1042 0.637 24 hist 1028 0.755 25 pedlar 1021 0.516 26 his 1019 0.731 27 schooner 978 0.661 28 frances 941 0.516 29 movements 903 0.979 30 as 900 0.813 31 until 886 0.939 32 huron 882 0.603 33 tier 864 0.511 34 returned 829 0.913 35 commander 816 0.811 36 pilot 788 0.754 37 seaman 751 0.688 38 hunter 745 0.653 39 alderman 744 0.548 40 wilson 739 0.525 41 warrior 721 0.694 42 latter 711 0.955 43 ark 708 0.551 44 duncan 693 0.531 45 woods 669 0.625 46 craft 659 0.769 47 their 646 0.649 48 companion 645 0.952 49 jack 639 0.661 50 order 629 0.914 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 movements 0.979 903 2 manner 0.972 1224 3 movement 0.97 576 4 direction 0.961 579 5 succeeded 0.96 567 6 commenced 0.958 374 7 latter 0.955 711 8 companion 0.952 645 9 aided 0.946 221 10 manifested 0.942 252 11 until 0.939 886 12 coolly 0.935 171 13 exceeded 0.925 86.2 14 audible 0.918 175 15 unusual 0.917 149 16 readiness 0.916 211 17 intently 0.916 128 18 distance 0.915 552 19 order 0.914 629 20 quest 0.913 190 21 returned 0.913 829 22 uneasiness 0.912 139 23 females 0.912 304 24 minutes 0.905 216 25 examination 0.904 114 26 nearly 0.902 224 27 companions 0.902 268 28 necessary 0.902 303 29 speaker 0.901 118 30 dialogue 0.901 169 31 vigilance 0.899 70.8 32 arrangement 0.898 80.9 33 ordinary 0.897 178 34 interruption 0.897 103 35 opinions 0.895 203 36 congress 0.895 128 37 disappeared 0.894 137 38 preparations 0.893 93.3 39 placing 0.893 74.7 40 position 0.892 168 41 officer 0.892 442 42 owner 0.892 138 43 interrupted 0.891 332 44 maintained 0.89 128 45 customary 0.89 158 46 notwithstanding 0.888 160 47 using 0.888 68.9 48 examining 0.888 106 49 occupied 0.887 91.4 50 exception 0.887 78.3 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: fic Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: poe Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 you 53619 0.865 2 was 39376 0.927 3 had 37091 0.95 4 said 28953 0.902 5 it 20689 0.923 6 she 18733 0.721 7 he 17422 0.802 8 have 10770 0.913 9 him 10039 0.835 10 her 9087 0.597 11 to 9056 0.832 12 been 8369 0.935 13 me 8196 0.72 14 any 8028 0.935 15 very 7978 0.881 16 do 6992 0.842 17 would 6531 0.88 18 not 6131 0.783 19 am 6085 0.85 20 don't 5729 0.781 21 could 5412 0.846 22 miss 5387 0.686 23 sir 5035 0.712 24 at 4876 0.813 25 himself 4840 0.883 26 little 4805 0.82 27 my 4764 0.574 28 were 4666 0.844 29 be 4565 0.818 30 replied 4464 0.843 31 myself 4404 0.862 32 about 4400 0.856 33 herself 4091 0.844 34 room 3968 0.873 35 that 3792 0.72 36 lady 3660 0.696 37 much 3453 0.861 38 moment 3443 0.855 39 your 3399 0.679 40 out 3273 0.792 41 nothing 3169 0.884 42 chapter 3088 0.796 43 know 3073 0.728 44 as 3022 0.804 45 however 3000 0.879 46 think 2844 0.755 47 going 2843 0.877 48 looking 2836 0.864 49 harry 2800 0.613 50 into 2728 0.884 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 had 0.95 37091 2 been 0.935 8369 3 any 0.935 8028 4 was 0.927 39376 5 it 0.923 20689 6 have 0.913 10770 7 conversation 0.903 2289 8 said 0.902 28953 9 continued 0.896 2134 10 really 0.896 2111 11 yourself 0.893 2234 12 possible 0.89 1713 13 taking 0.884 1321 14 after 0.884 2491 15 nothing 0.884 3169 16 into 0.884 2728 17 himself 0.883 4840 18 manner 0.882 2037 19 very 0.881 7978 20 would 0.88 6531 21 however 0.879 3000 22 person 0.878 1860 23 impossible 0.877 947 24 going 0.877 2843 25 interrupted 0.875 1045 26 room 0.873 3968 27 opportunity 0.873 987 28 immediately 0.871 1397 29 difficulty 0.87 744 30 indeed 0.87 2298 31 expected 0.869 1046 32 appearance 0.868 1274 33 put 0.867 1807 34 family 0.866 2345 35 you 0.865 53619 36 happened 0.864 692 37 apartment 0.864 1759 38 looking 0.864 2836 39 easily 0.862 633 40 myself 0.862 4404 41 much 0.861 3453 42 satisfaction 0.861 864 43 countenance 0.86 1621 44 observed 0.859 850 45 proceeded 0.859 1147 46 business 0.857 1697 47 about 0.856 4400 48 rather 0.856 1609 49 having 0.856 1964 50 determined 0.856 1390 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: For the corpus to be characterized: An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: poe Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all For the reference corpus (in terms of which it will be characterized): An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: fic Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 1 thy 64771 0.892 2 thou 24793 0.826 3 thee 22860 0.833 4 tis 13123 0.88 5 th 12851 0.773 6 earth 12348 0.76 7 canto 11456 0.585 8 their 10363 0.707 9 love 9636 0.777 10 oft 9529 0.882 11 song 9265 0.828 12 thine 9095 0.826 13 hath 8927 0.703 14 heaven 8656 0.792 15 sweet 8352 0.799 16 each 7804 0.853 17 ye 7772 0.797 18 nor 7690 0.846 19 twas 7335 0.857 20 bright 7334 0.808 21 fair 7249 0.809 22 berkeley 6826 0.5 23 its 6516 0.653 24 god 6416 0.634 25 light 6276 0.786 26 soul 6170 0.774 27 where 6131 0.819 28 through 5962 0.771 29 ii 5803 0.679 30 fame 5715 0.823 31 esq 5620 0.621 32 notes 5502 0.706 33 sun 5483 0.788 34 from 5379 0.763 35 art 5345 0.819 36 thus 5125 0.749 37 muse 4988 0.753 38 breast 4920 0.802 39 death 4818 0.726 40 beneath 4721 0.734 41 glory 4622 0.772 42 skies 4593 0.809 43 lo 4416 0.763 44 eye 4384 0.754 45 whose 4313 0.759 46 yon 4304 0.767 47 sky 4277 0.778 48 joy 4243 0.772 49 morn 4225 0.796 50 iv 4180 0.646 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 1 thy 0.892 64771 2 oft 0.882 9529 3 tis 0.88 13123 4 twas 0.857 7335 5 each 0.853 7804 6 nor 0.846 7690 7 thee 0.833 22860 8 song 0.828 9265 9 thou 0.826 24793 10 thine 0.826 9095 11 fame 0.823 5715 12 art 0.819 5345 13 where 0.819 6131 14 fair 0.809 7249 15 skies 0.809 4593 16 bright 0.808 7334 17 breast 0.802 4920 18 high 0.8 3875 19 sweet 0.799 8352 20 ye 0.797 7772 21 morn 0.796 4225 22 still 0.794 3624 23 heaven 0.792 8656 24 sun 0.788 5483 25 strife 0.787 3038 26 light 0.786 6276 27 rise 0.783 3517 28 sky 0.778 4277 29 ere 0.777 3434 30 love 0.777 9636 31 blessed 0.776 3706 32 soul 0.774 6170 33 th 0.773 12851 34 joy 0.772 4243 35 vain 0.772 3847 36 throne 0.772 3196 37 glory 0.772 4622 38 shine 0.771 2541 39 through 0.771 5962 40 woe 0.768 4179 41 yon 0.767 4304 42 shade 0.766 3669 43 soft 0.764 3706 44 from 0.763 5379 45 lo 0.763 4416 46 yet 0.763 3434 47 vale 0.761 2127 48 earth 0.76 12348 49 whose 0.759 4313 50 wave 0.756 4106 Hit return to continue, say "quit" to quit, or say "corpus" to get a list of the documents you just characterized: