R version 2.15.2 (2012-10-26) -- "Trick or Treat"
Copyright (C) 2012 The R Foundation for Statistical Computing
ISBN 3-900051-07-0
Platform: x86_64-apple-darwin9.8.0/x86_64 (64-bit)

R is free software and comes with ABSOLUTELY NO WARRANTY.
You are welcome to redistribute it under certain conditions.
Type 'license()' or 'licence()' for distribution details.

  Natural language support but running in an English locale

R is a collaborative project with many contributors.
Type 'contributors()' for more information and
'citation()' on how to cite R or R packages in publications.

Type 'demo()' for some demos, 'help()' for on-line help, or
'help.start()' for an HTML browser interface to help.
Type 'q()' to quit R.

[R.app GUI 1.53 (6335) x86_64-apple-darwin9.8.0]

[Workspace restored from /Users/Erik/.RData]
[History restored from /Users/Erik/.Rapp.history]

> source("/Users/Erik/Downloads/Ted_Underwood_Files_for_R/NassrProgram.R")
You'll need to find the necessary data files on your own computer.
Ready to select NassrMetadata? yes

Ready to select NassrData? 

Unpacking the data. This may take five to ten minutes.Here are a few of the most common authors in this dataset.Authors
      MoreH  HawthorneN      ScottW     CooperJ     HayleyW   SedgwickC   HolcroftT   Anonymous 
         55          33          32          22          20          20          18          14 
     PrattM      ByronG    DickensC  EdgeworthM  ThackerayW     DibdinC   InchbaldM     LennoxC 
         14          13          13          13          11          10          10          10 
    AustenJ JerninghamM     IrvingW  RadcliffeA 
          9           9           8           8 
When prompted for an author name you can say "authors" to repeat that list.For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: EdgeworthM
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	harry          	10098	0.693	
2	lucy           	7956	0.676	
3	lady           	5269	0.837	
4	ormond         	5081	0.598	
5	percy          	4819	0.49	
6	belinda        	4383	0.523	
7	hervey         	3141	0.527	
8	falconer       	2813	0.519	
9	said           	2783	0.852	
10	leonora        	2591	0.634	
11	you            	2467	0.868	
12	caroline       	2427	0.552	
13	clarence       	2349	0.519	
14	lord           	2033	0.496	
15	helen          	1930	0.555	
16	portman        	1834	0.522	
17	rosamond       	1792	0.52	
18	rupert         	1641	0.638	
19	ladyship       	1637	0.824	
20	alfred         	1458	0.53	
21	commissioner   	1451	0.508	
22	count          	1367	0.577	
23	she            	1099	0.79	
24	percival       	1016	0.524	
25	vincent        	988	0.548	
26	cecilia        	883	0.583	
27	thing          	811	0.835	
28	not            	802	0.853	
29	dear           	792	0.88	
30	could          	754	0.913	
31	virginia       	732	0.492	
32	sir            	715	0.7	
33	godfrey        	677	0.513	
34	mamma          	668	0.735	
35	ireland        	666	0.691	
36	any            	632	0.85	
37	miss           	627	0.767	
38	olivia         	612	0.566	
39	lordship       	606	0.6	
40	do             	586	0.884	
41	steam          	559	0.584	
42	it             	531	0.807	
43	know           	523	0.916	
44	that           	514	0.835	
45	am             	491	0.82	
46	whilst         	419	0.744	
47	barclay        	406	0.517	
48	pump           	388	0.602	
49	all            	387	0.77	
50	be             	383	0.824	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	understand     	0.937	271	
2	recollect      	0.923	309	
3	talking        	0.916	127	
4	know           	0.916	523	
5	could          	0.913	754	
6	provoking      	0.912	41.9	
7	nonsense       	0.911	62.3	
8	perfectly      	0.905	119	
9	explain        	0.903	192	
10	continually    	0.889	95.4	
11	tired          	0.888	76	
12	going          	0.888	205	
13	do             	0.884	586	
14	dear           	0.88	792	
15	sorry          	0.879	79.5	
16	satisfied      	0.879	93.8	
17	yesterday      	0.879	48.9	
18	liked          	0.875	48.1	
19	spoiled        	0.874	19.6	
20	directly       	0.869	77.2	
21	quite          	0.869	136	
22	please         	0.868	182	
23	you            	0.868	2467	
24	repeated       	0.868	233	
25	decide         	0.866	101	
26	afraid         	0.864	148	
27	repeating      	0.862	52.7	
28	thank          	0.862	115	
29	manage         	0.86	44	
30	guess          	0.86	97.8	
31	sure           	0.859	290	
32	ashamed        	0.857	35.4	
33	put            	0.856	140	
34	admiration     	0.855	90.5	
35	disappointed   	0.855	44.8	
36	surprised      	0.855	75.6	
37	tiresome       	0.853	37.2	
38	especially     	0.853	76.3	
39	not            	0.853	802	
40	reading        	0.853	80.1	
41	dressing       	0.852	9.04	
42	said           	0.852	2783	
43	formerly       	0.851	50	
44	understanding  	0.851	103	
45	possible       	0.85	157	
46	because        	0.85	261	
47	really         	0.85	125	
48	any            	0.85	632	
49	saw            	0.85	183	
50	think          	0.85	173	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: 

For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: LennoxC
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	mr             	2961	0.943	
2	my             	2019	0.743	
3	howard         	1837	0.716	
4	harley         	1787	0.629	
5	greville       	1495	0.641	
6	mrs            	1483	0.932	
7	neville        	945	0.639	
8	fanny          	894	0.815	
9	me             	870	0.799	
10	aubrey         	821	0.591	
11	sophia         	663	0.725	
12	lady           	564	0.88	
13	wilmot         	556	0.831	
14	extremely      	548	0.984	
15	her            	534	0.826	
16	seymour        	510	0.679	
17	uncle          	503	0.767	
18	hero           	444	0.523	
19	wholly         	421	0.961	
20	behaviour      	419	0.954	
21	our            	419	0.794	
22	this           	367	0.904	
23	which          	356	0.749	
24	amiable        	353	0.959	
25	to             	326	0.86	
26	letter         	312	0.916	
27	madam          	306	0.923	
28	benson         	271	0.636	
29	myself         	263	0.827	
30	situation      	260	0.924	
31	conversation   	258	0.957	
32	george         	258	0.492	
33	us             	255	0.734	
34	maria          	235	0.62	
35	informed       	232	0.958	
36	miss           	224	0.893	
37	regard         	212	0.812	
38	instantly      	209	0.927	
39	company        	196	0.896	
40	conduct        	195	0.929	
41	satisfaction   	187	0.967	
42	family         	185	0.895	
43	bradshaw       	181	0.69	
44	louisa         	178	0.76	
45	however        	176	0.81	
46	she            	174	0.814	
47	uneasiness     	171	0.889	
48	dear           	166	0.779	
49	so             	155	0.752	
50	convinced      	144	0.837	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	extremely      	0.984	548	
2	civility       	0.97	117	
3	apprehensive   	0.97	96.4	
4	satisfaction   	0.967	187	
5	warmest        	0.961	80.3	
6	wholly         	0.961	421	
7	amiable        	0.959	353	
8	accompany      	0.959	55.8	
9	informed       	0.958	232	
10	relating       	0.957	83.3	
11	conversation   	0.957	258	
12	behaviour      	0.954	419	
13	mortified      	0.949	34.6	
14	mortification  	0.948	113	
15	received       	0.945	119	
16	mr             	0.943	2961	
17	opportunity    	0.942	76.5	
18	amusements     	0.939	32.3	
19	entreaties     	0.937	54.9	
20	apprehensions  	0.937	89.4	
21	attentions     	0.936	70.9	
22	consequences   	0.933	44.2	
23	degree         	0.933	99.7	
24	mrs            	0.932	1483	
25	scheme         	0.932	55	
26	imagine        	0.93	114	
27	conduct        	0.929	195	
28	insisted       	0.928	80.6	
29	instantly      	0.927	209	
30	countenance    	0.925	123	
31	situation      	0.924	260	
32	madam          	0.923	306	
33	visit          	0.923	107	
34	ladies         	0.922	114	
35	arrival        	0.922	83.5	
36	acknowledged   	0.92	53	
37	reception      	0.92	46.8	
38	circumstance   	0.919	98.7	
39	emotions       	0.918	50.8	
40	concluded      	0.917	115	
41	relations      	0.917	84.3	
42	letter         	0.916	312	
43	politeness     	0.916	110	
44	shocked        	0.914	89.2	
45	accident       	0.913	74.1	
46	inform         	0.913	74.8	
47	acquaintance   	0.912	131	
48	afforded       	0.912	50.2	
49	sentiments     	0.91	83.9	
50	ordered        	0.91	66.6	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: 

For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: ScottW,fic
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	which          	4162	0.913	
2	said           	4137	0.927	
3	wad            	4100	0.691	
4	abbot          	3697	0.75	
5	woodstock      	3620	0.571	
6	ye             	3003	0.697	
7	hae            	2589	0.748	
8	of             	2555	0.735	
9	answered       	2519	0.958	
10	everard        	1858	0.577	
11	knight         	1698	0.859	
12	morton         	1631	0.584	
13	weel           	1605	0.748	
14	roland         	1601	0.604	
15	thou           	1569	0.663	
16	wi             	1553	0.78	
17	betwixt        	1528	0.958	
18	king           	1413	0.78	
19	sae            	1348	0.768	
20	master         	1346	0.823	
21	scotland       	1335	0.942	
22	your           	1308	0.818	
23	castle         	1258	0.862	
24	ay             	1182	0.929	
25	ken            	1148	0.794	
26	ashton         	1129	0.513	
27	bertram        	1114	0.53	
28	nae            	1114	0.727	
29	duke           	1108	0.756	
30	folk           	1096	0.797	
31	laird          	1074	0.851	
32	louis          	1037	0.644	
33	scottish       	1035	0.947	
34	lovel          	1017	0.512	
35	apartment      	1001	0.939	
36	highland       	952	0.789	
37	glover         	945	0.598	
38	catharine      	929	0.541	
39	leicester      	915	0.553	
40	ain            	890	0.75	
41	mair           	886	0.792	
42	earl           	878	0.837	
43	mr             	866	0.645	
44	bailie         	860	0.778	
45	mac            	845	0.656	
46	ane            	836	0.781	
47	alan           	831	0.511	
48	mowbray        	823	0.526	
49	tyrrel         	822	0.537	
50	antiquary      	790	0.775	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	answered       	0.958	2519	
2	betwixt        	0.958	1528	
3	scottish       	0.947	1035	
4	warrant        	0.944	501	
5	purpose        	0.944	725	
6	scotland       	0.942	1335	
7	apartment      	0.939	1001	
8	risk           	0.93	263	
9	ay             	0.929	1182	
10	said           	0.927	4137	
11	excepting      	0.927	345	
12	termed         	0.918	161	
13	permit         	0.914	247	
14	trusty         	0.913	169	
15	which          	0.913	4162	
16	farther        	0.909	572	
17	wench          	0.907	212	
18	ordinary       	0.907	223	
19	weapon         	0.905	235	
20	ale            	0.903	222	
21	formidable     	0.903	147	
22	boot           	0.902	127	
23	followers      	0.898	505	
24	personal       	0.897	185	
25	safety         	0.897	365	
26	partly         	0.897	239	
27	domestics      	0.897	122	
28	desirous       	0.896	215	
29	edinburgh      	0.895	587	
30	commanded      	0.895	222	
31	hastily        	0.894	233	
32	courtesy       	0.894	262	
33	quarrel        	0.893	183	
34	kinsman        	0.892	432	
35	assistance     	0.892	248	
36	otherwise      	0.892	150	
37	saddle         	0.891	109	
38	authority      	0.891	253	
39	trow           	0.89	158	
40	assuming       	0.89	53.6	
41	assumed        	0.89	124	
42	tone           	0.89	375	
43	displeasure    	0.89	123	
44	attendance     	0.889	162	
45	communication  	0.889	143	
46	hasty          	0.889	153	
47	willingly      	0.889	170	
48	somewhat       	0.887	247	
49	accordingly    	0.887	247	
50	corresponding  	0.887	104	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: 

For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: authors
Authors
      MoreH  HawthorneN      ScottW     CooperJ     HayleyW   SedgwickC   HolcroftT   Anonymous 
         55          33          32          22          20          20          18          14 
     PrattM      ByronG    DickensC  EdgeworthM  ThackerayW     DibdinC   InchbaldM     LennoxC 
         14          13          13          13          11          10          10          10 
    AustenJ JerninghamM     IrvingW  RadcliffeA 
          9           9           8           8 
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: MoreH,fic
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	stanley        	2807	0.497	
2	stock          	1283	0.529	
3	religion       	1136	0.674	
4	worthy         	717	0.505	
5	religious      	646	0.898	
6	god            	619	0.823	
7	farmer         	558	0.667	
8	sunday         	556	0.945	
9	mr             	556	0.669	
10	good           	540	0.91	
11	christian      	505	0.681	
12	cheap          	463	0.957	
13	sin            	460	0.828	
14	brown          	457	0.618	
15	piety          	410	0.607	
16	jack           	393	0.552	
17	betty          	378	0.605	
18	not            	370	0.793	
19	bible          	349	0.754	
20	tyrrel         	335	0.509	
21	giles          	334	0.56	
22	john           	332	0.448	
23	jones          	331	0.612	
24	christianity   	325	0.51	
25	principle      	310	0.565	
26	parish         	299	0.804	
27	shepherd       	294	0.523	
28	they           	285	0.744	
29	hester         	276	0.533	
30	is             	273	0.542	
31	poor           	270	0.851	
32	little         	267	0.869	
33	money          	244	0.729	
34	because        	244	0.919	
35	to             	230	0.798	
36	daughters      	224	0.396	
37	things         	222	0.848	
38	children       	220	0.813	
39	church         	214	0.79	
40	sins           	205	0.621	
41	them           	204	0.769	
42	master         	203	0.637	
43	part           	203	0.811	
44	always         	200	0.804	
45	worldly        	198	0.618	
46	sober          	196	0.644	
47	much           	194	0.878	
48	own            	194	0.783	
49	learning       	191	0.538	
50	vanity         	187	0.566	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	cheap          	0.957	463	
2	sunday         	0.945	556	
3	because        	0.919	244	
4	allowance      	0.915	92.9	
5	good           	0.91	540	
6	for            	0.907	114	
7	religious      	0.898	646	
8	got            	0.895	135	
9	penny          	0.885	120	
10	much           	0.878	194	
11	sold           	0.875	69.3	
12	little         	0.869	267	
13	price          	0.867	147	
14	bath           	0.862	27.2	
15	out            	0.857	104	
16	hazard         	0.855	32.1	
17	marshal        	0.852	47.1	
18	get            	0.851	80.5	
19	poor           	0.851	270	
20	indeed         	0.85	116	
21	things         	0.848	222	
22	people         	0.839	76.6	
23	keep           	0.835	54.8	
24	sin            	0.828	460	
25	god            	0.823	619	
26	bad            	0.822	165	
27	put            	0.818	47.4	
28	very           	0.816	13	
29	way            	0.815	51.5	
30	set            	0.815	102	
31	children       	0.813	220	
32	do             	0.812	131	
33	comfort        	0.811	127	
34	part           	0.811	203	
35	make           	0.809	149	
36	always         	0.804	200	
37	parish         	0.804	299	
38	so             	0.799	93.8	
39	to             	0.798	230	
40	churchyard     	0.794	45.8	
41	be             	0.793	158	
42	not            	0.793	370	
43	used           	0.792	84.5	
44	church         	0.79	214	
45	help           	0.788	30.1	
46	sure           	0.788	34.9	
47	use            	0.784	95.6	
48	own            	0.783	194	
49	per            	0.783	67.2	
50	moral          	0.78	48.5	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: 

For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: CooperJ
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: all
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	spike          	3653	0.59	
2	of             	3341	0.806	
3	judith         	2910	0.524	
4	captain        	2436	0.815	
5	ludlow         	2155	0.541	
6	maud           	2099	0.535	
7	bravo          	2052	0.584	
8	indian         	1814	0.711	
9	griffith       	1716	0.578	
10	rifle          	1625	0.688	
11	brig           	1593	0.587	
12	nick           	1587	0.5	
13	boat           	1453	0.74	
14	scout          	1391	0.635	
15	vessel         	1367	0.745	
16	water          	1324	0.8	
17	manner         	1224	0.972	
18	canoe          	1184	0.683	
19	delaware       	1175	0.755	
20	hurry          	1124	0.679	
21	willoughby     	1112	0.53	
22	wharton        	1103	0.53	
23	ship           	1042	0.637	
24	hist           	1028	0.755	
25	pedlar         	1021	0.516	
26	his            	1019	0.731	
27	schooner       	978	0.661	
28	frances        	941	0.516	
29	movements      	903	0.979	
30	as             	900	0.813	
31	until          	886	0.939	
32	huron          	882	0.603	
33	tier           	864	0.511	
34	returned       	829	0.913	
35	commander      	816	0.811	
36	pilot          	788	0.754	
37	seaman         	751	0.688	
38	hunter         	745	0.653	
39	alderman       	744	0.548	
40	wilson         	739	0.525	
41	warrior        	721	0.694	
42	latter         	711	0.955	
43	ark            	708	0.551	
44	duncan         	693	0.531	
45	woods          	669	0.625	
46	craft          	659	0.769	
47	their          	646	0.649	
48	companion      	645	0.952	
49	jack           	639	0.661	
50	order          	629	0.914	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	movements      	0.979	903	
2	manner         	0.972	1224	
3	movement       	0.97	576	
4	direction      	0.961	579	
5	succeeded      	0.96	567	
6	commenced      	0.958	374	
7	latter         	0.955	711	
8	companion      	0.952	645	
9	aided          	0.946	221	
10	manifested     	0.942	252	
11	until          	0.939	886	
12	coolly         	0.935	171	
13	exceeded       	0.925	86.2	
14	audible        	0.918	175	
15	unusual        	0.917	149	
16	readiness      	0.916	211	
17	intently       	0.916	128	
18	distance       	0.915	552	
19	order          	0.914	629	
20	quest          	0.913	190	
21	returned       	0.913	829	
22	uneasiness     	0.912	139	
23	females        	0.912	304	
24	minutes        	0.905	216	
25	examination    	0.904	114	
26	nearly         	0.902	224	
27	companions     	0.902	268	
28	necessary      	0.902	303	
29	speaker        	0.901	118	
30	dialogue       	0.901	169	
31	vigilance      	0.899	70.8	
32	arrangement    	0.898	80.9	
33	ordinary       	0.897	178	
34	interruption   	0.897	103	
35	opinions       	0.895	203	
36	congress       	0.895	128	
37	disappeared    	0.894	137	
38	preparations   	0.893	93.3	
39	placing        	0.893	74.7	
40	position       	0.892	168	
41	officer        	0.892	442	
42	owner          	0.892	138	
43	interrupted    	0.891	332	
44	maintained     	0.89	128	
45	customary      	0.89	158	
46	notwithstanding	0.888	160	
47	using          	0.888	68.9	
48	examining      	0.888	106	
49	occupied       	0.887	91.4	
50	exception      	0.887	78.3	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: 

For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: fic
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: poe
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	you            	53619	0.865	
2	was            	39376	0.927	
3	had            	37091	0.95	
4	said           	28953	0.902	
5	it             	20689	0.923	
6	she            	18733	0.721	
7	he             	17422	0.802	
8	have           	10770	0.913	
9	him            	10039	0.835	
10	her            	9087	0.597	
11	to             	9056	0.832	
12	been           	8369	0.935	
13	me             	8196	0.72	
14	any            	8028	0.935	
15	very           	7978	0.881	
16	do             	6992	0.842	
17	would          	6531	0.88	
18	not            	6131	0.783	
19	am             	6085	0.85	
20	don't          	5729	0.781	
21	could          	5412	0.846	
22	miss           	5387	0.686	
23	sir            	5035	0.712	
24	at             	4876	0.813	
25	himself        	4840	0.883	
26	little         	4805	0.82	
27	my             	4764	0.574	
28	were           	4666	0.844	
29	be             	4565	0.818	
30	replied        	4464	0.843	
31	myself         	4404	0.862	
32	about          	4400	0.856	
33	herself        	4091	0.844	
34	room           	3968	0.873	
35	that           	3792	0.72	
36	lady           	3660	0.696	
37	much           	3453	0.861	
38	moment         	3443	0.855	
39	your           	3399	0.679	
40	out            	3273	0.792	
41	nothing        	3169	0.884	
42	chapter        	3088	0.796	
43	know           	3073	0.728	
44	as             	3022	0.804	
45	however        	3000	0.879	
46	think          	2844	0.755	
47	going          	2843	0.877	
48	looking        	2836	0.864	
49	harry          	2800	0.613	
50	into           	2728	0.884	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	had            	0.95	37091	
2	been           	0.935	8369	
3	any            	0.935	8028	
4	was            	0.927	39376	
5	it             	0.923	20689	
6	have           	0.913	10770	
7	conversation   	0.903	2289	
8	said           	0.902	28953	
9	continued      	0.896	2134	
10	really         	0.896	2111	
11	yourself       	0.893	2234	
12	possible       	0.89	1713	
13	taking         	0.884	1321	
14	after          	0.884	2491	
15	nothing        	0.884	3169	
16	into           	0.884	2728	
17	himself        	0.883	4840	
18	manner         	0.882	2037	
19	very           	0.881	7978	
20	would          	0.88	6531	
21	however        	0.879	3000	
22	person         	0.878	1860	
23	impossible     	0.877	947	
24	going          	0.877	2843	
25	interrupted    	0.875	1045	
26	room           	0.873	3968	
27	opportunity    	0.873	987	
28	immediately    	0.871	1397	
29	difficulty     	0.87	744	
30	indeed         	0.87	2298	
31	expected       	0.869	1046	
32	appearance     	0.868	1274	
33	put            	0.867	1807	
34	family         	0.866	2345	
35	you            	0.865	53619	
36	happened       	0.864	692	
37	apartment      	0.864	1759	
38	looking        	0.864	2836	
39	easily         	0.862	633	
40	myself         	0.862	4404	
41	much           	0.861	3453	
42	satisfaction   	0.861	864	
43	countenance    	0.86	1621	
44	observed       	0.859	850	
45	proceeded      	0.859	1147	
46	business       	0.857	1697	
47	about          	0.856	4400	
48	rather         	0.856	1609	
49	having         	0.856	1964	
50	determined     	0.856	1390	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: 

For the corpus to be characterized:
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: poe
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

For the reference corpus (in terms of which it will be characterized):
An author, like ByronG, or a genre (fic/poe/all), or author-comma-genre to get the intersection: fic
Date range. Either "all" or two dates between 1780 and 1859 separated by a hyphen: all

 WORDS OVERREPRESENTED BY LOG-LIKELIHOOD 
1	thy            	64771	0.892	
2	thou           	24793	0.826	
3	thee           	22860	0.833	
4	tis            	13123	0.88	
5	th             	12851	0.773	
6	earth          	12348	0.76	
7	canto          	11456	0.585	
8	their          	10363	0.707	
9	love           	9636	0.777	
10	oft            	9529	0.882	
11	song           	9265	0.828	
12	thine          	9095	0.826	
13	hath           	8927	0.703	
14	heaven         	8656	0.792	
15	sweet          	8352	0.799	
16	each           	7804	0.853	
17	ye             	7772	0.797	
18	nor            	7690	0.846	
19	twas           	7335	0.857	
20	bright         	7334	0.808	
21	fair           	7249	0.809	
22	berkeley       	6826	0.5	
23	its            	6516	0.653	
24	god            	6416	0.634	
25	light          	6276	0.786	
26	soul           	6170	0.774	
27	where          	6131	0.819	
28	through        	5962	0.771	
29	ii             	5803	0.679	
30	fame           	5715	0.823	
31	esq            	5620	0.621	
32	notes          	5502	0.706	
33	sun            	5483	0.788	
34	from           	5379	0.763	
35	art            	5345	0.819	
36	thus           	5125	0.749	
37	muse           	4988	0.753	
38	breast         	4920	0.802	
39	death          	4818	0.726	
40	beneath        	4721	0.734	
41	glory          	4622	0.772	
42	skies          	4593	0.809	
43	lo             	4416	0.763	
44	eye            	4384	0.754	
45	whose          	4313	0.759	
46	yon            	4304	0.767	
47	sky            	4277	0.778	
48	joy            	4243	0.772	
49	morn           	4225	0.796	
50	iv             	4180	0.646	

 WORDS OVERREPRESENTED BY MANN-WHITNEY RHO 
1	thy            	0.892	64771	
2	oft            	0.882	9529	
3	tis            	0.88	13123	
4	twas           	0.857	7335	
5	each           	0.853	7804	
6	nor            	0.846	7690	
7	thee           	0.833	22860	
8	song           	0.828	9265	
9	thou           	0.826	24793	
10	thine          	0.826	9095	
11	fame           	0.823	5715	
12	art            	0.819	5345	
13	where          	0.819	6131	
14	fair           	0.809	7249	
15	skies          	0.809	4593	
16	bright         	0.808	7334	
17	breast         	0.802	4920	
18	high           	0.8	3875	
19	sweet          	0.799	8352	
20	ye             	0.797	7772	
21	morn           	0.796	4225	
22	still          	0.794	3624	
23	heaven         	0.792	8656	
24	sun            	0.788	5483	
25	strife         	0.787	3038	
26	light          	0.786	6276	
27	rise           	0.783	3517	
28	sky            	0.778	4277	
29	ere            	0.777	3434	
30	love           	0.777	9636	
31	blessed        	0.776	3706	
32	soul           	0.774	6170	
33	th             	0.773	12851	
34	joy            	0.772	4243	
35	vain           	0.772	3847	
36	throne         	0.772	3196	
37	glory          	0.772	4622	
38	shine          	0.771	2541	
39	through        	0.771	5962	
40	woe            	0.768	4179	
41	yon            	0.767	4304	
42	shade          	0.766	3669	
43	soft           	0.764	3706	
44	from           	0.763	5379	
45	lo             	0.763	4416	
46	yet            	0.763	3434	
47	vale           	0.761	2127	
48	earth          	0.76	12348	
49	whose          	0.759	4313	
50	wave           	0.756	4106	

Hit return to continue, say "quit" to quit, or say "corpus" to get a
list of the documents you just characterized: