Files
INTUIA/Programa final/spacy/tests/training/__pycache__/test_training.cpython-312.pyc
T

399 lines
56 KiB
Plaintext
Raw Normal View History

2026-03-15 13:27:50 +00:00
Ë
?û gy¶ãóÊddlZddlZddlZddlZddlmZmZddlZddlm Z ddl
m Z m Z ddl
mZmZmZmZmZmZmZmZddlmZddlmZddlmZdd lmZdd
lmZm Z m!Z!m"Z"d d l#m$Z$ejJd
«Z&ejJ«d«Z'ejJd«Z(ejRjUd«d«Z+ejRjUd«d«Z,dZ-ejRjUd«d«Z.dZ/dZ0dZ1dZ2dZ3dZ4dZ5dZ6dZ7ejRjqd «d!„«Z9d"„Z:d#„Z;d$„Z<d%„Z=d&„Z>d'„Z?d(„Z@d)„ZAd*„ZBd+„ZCd,„ZDd-„ZEd.„ZFd/„ZGd0„ZHd1„ZId2„ZJejRj—d3gd4¢d5d6gdgdgd7ggdd7gd ggffgd8¢d9d6gdgdgdgd7gggd:¢d;ggffd<d=gd5d6gdgdd7ggdd7gd7ggffgd>¢gd?¢dd7gd gd ggdgdgd7d ggffgd?¢gd@¢dgd7gd d;ggdgd7gd gd ggffdAd<gd<ggdggd7ggffgdB¢gdC¢dgdd7gd7gd ggdd7gd7d gd;ggffg«dD„«ZLdE„ZMdF„ZNdG„ZOdH„ZPdI„ZQdJ„ZRdK„ZSdL„ZTdM„ZUdN„ZVdO„ZWdP„ZXdQ„ZYdR„ZZdS„Z[dT„Z\y)UéN)ÚAdamÚ compounding)ÚEnglish)ÚDocÚDocBin)Ú AlignmentÚCorpusÚExampleÚbiluo_tags_to_offsetsÚbiluo_tags_to_spansÚ docs_to_jsonÚ iob_to_biluoÚoffsets_to_biluo_tags©Úget_alignments)ÚAlignmentArray)Ú json_to_docs)Útrain_while_improving)Úget_words_and_spacesÚload_config_from_strÚload_model_from_pathÚ minibatché)Ú make_tempdirc óöt«}gd¢}gd¢}gd¢}gd¢}gd¢}gd¢}gd¢}dgt|«z}d |d
<d |d <d
|d<d|d<d|d<dddœ} t|j||||||||¬« }
| |
_|
S)
ÚSarahú'sÚsisterÚflewÚtoÚSiliconÚValleyÚviaÚLondonú.)
ÚNNPÚPOSÚNNÚVBDÚINr'r'r+r'r&)
ÚPROPNÚPARTÚNOUNÚVERBÚADPr,r,r0r,ÚPUNCT)
úNounType=prop|Number=singzPoss=yesz Number=singzTense=past|VerbForm=finÚr2r2r3r2zPunctType=peri)
rrér4r4éér4éé)
ÚpossÚcaseÚnsubjÚROOTÚprepÚcompoundÚpobjr=r?Úpunct)
rrrÚflyr!r"r#r$r%r&ÚB-PERSONrzI-PERSONéúB-LOCr8úI-LOCr5zB-GPEéçð?ç)ÚTRAVELÚBAKING)ÚwordsÚtagsÚposÚmorphsÚheadsÚdepsÚlemmasÚents)rÚlenrÚvocabÚcats) ÚnlprLrMrNrOrPrQrRrSrVÚdocs úcC:\Users\garci\AppData\Roaming\Python\Python312\site-packages\spacy/tests/training/test_training.pyrXrX#ä
)€Câ ^€EÚ L€DÚ
]€Cò=€Fò
+€EÚ a€DÚ
^€FØ ˆ5”3u“:Ñ €DØ€Dˆ€Dˆ€Dˆ€Dˆ€Dˆ SÑ )€Dä
Ø ‰ ØØ
Ø ØØØ
ØØ
ô
€Cð€C„HØ €Jócó&gd¢gd¢gd¢gd¢gd¢dœS)N)rDrr4r6r8r5r7)ÚHiÚthereÚeveryoneÚItÚisÚjustÚme)TTTTTTF)ÚINTJÚADVÚPRONreÚAUXrdre)rDrrrDrrr)ÚidsrLÚspacesrMÚ sent_starts©rjrZrYÚ merged_dictrkHsò  ðrZcó.t«}|jS©N)rrU)rWs rYrUrUSsä
)€CØ 9‰9ÐrZc
ó&dggdggdggdggdggdggddggd d
ggd d ggg }t«}|jd
«}|D]!\}}|D]\}}}|j|«ŒŒ#|j«t d«D]Z}t j |«|D]>\} }
tj|j| «d|
i«} |j| g«Œ@Œ\t«5} |j| «t| «}
ddd«|D]s\} }

| «}|jDcic]%}|j|j f|j"Œ'}}|
D]'\}}}||f|vr|||f|k(sJŒe|
sŒt%|«Œuy#1swYŒ‚xYwcc}w)a$Test that adding entities and resuming training works passably OK.
There are two issues here:
1) We have to re-add labels. This isn't very nice.
2) There's no way to set the learning rate for the weight update, so we
end up out-of-scale, causing it to learn too fast.
ÚheyÚhowdyz hey thereÚhelloÚhizi'm looking for a place to eatz,i'm looking for a place in the north of town)éé$ÚLOCATIONzshow me chinese restaurants)rGéÚCUISINEzshow me chines restaurants)rGérwÚneréÚentitiesN)rÚadd_pipeÚ add_labelÚ
initializeÚrangeÚrandomÚshuffler
Ú from_dictÚmake_docÚupdaterÚto_diskrrSÚ
start_charÚend_charÚlabel_Ú Exception)Ú
TRAIN_DATArWryÚoffsetsÚstartÚendÚlabelÚitnÚraw_textÚentity_offsetsÚexampleÚ model_dirÚnlp2rXÚentrSs rYÚ
test_issue999r—Yð
ˆ Ø "ˆ
Ø Ø "ˆ
Ø
ˆrˆ
Ø )¨2Ð.Ø 7Ð:NÐ9OÐPØ &Ð);Ð(<Ð=Ø %Ð(:Ð';Ð
€Jô )€CØ
,‰,
€CÛ
ˆˆ7Û!(Ñ ˆEØ M‰M˜ ñ")ð‡NÜRŽyˆÜ"Û(2Ñ $ˆH× ˜°^Ð(Dóˆ
J‰J˜ )3ðô
Œ˜9Ø Ü# IÓ÷
ó%/Ñ ˆ8‹nˆØFIÇhÂhÓOÁh¸s §¡Ð·
±
Ñ:ÀhˆÐOÛ!/Ñ ˆEØsˆ|˜tјU C˜LÑ)¨UÒâ# D›/Ð
"0ñ%/÷
ˆüò PsÃ$FÄ'*FÆF i2cóŒddddddœddddœdd ddœd
d ddœd d
ddœddddœddddœddddœddddœg gdœddddœdd ddœddddœddddœddddœd d!ddœd"d#ddœd$d
ddœd%dddœd&dddœd'dddœd(d)ddœg gdœgd*d+d,œd-d.d,œgd/œd0dd1ddœdd2ddœdd3ddœd
d4ddœd d5ddœdd6ddœdd7ddœdd8ddœdd9ddœddddœg
gdœddddœggdœgd*d.d,œd-d+d,œgd/œgd:œ}t«}gd;¢}t«5}|d<z }t|g«}t||¬=«j «}|j d>«5}|j
|«ddd«t|«}t||««} t| «dk(sJg}
| D]!} |
j| j««Œ#t|
«d k(sJ ddd«y#1swYŒwxYw#1swYyxYw)?NrzRHow should I cook bacon in an oven?
I've heard of people cooking bacon in an oven.ÚHowrB)ÚidÚorthryrDÚshouldrÚIr4Úcookr6Úbaconr8Úinr5Úanr7ÚovenrGú?)ÚtokensÚbracketsé ú
é
é z'veé Úheardé
ÚofrxÚpeoplervÚcookingéééérzr&ÚbakingrH)rÚvalueÚ
not_bakingrI)ÚrawÚ sentencesrVz5What is the difference between white and brown eggs?
ÚWhatr`ÚtheÚ
differenceÚbetweenÚwhiteÚandÚbrownÚeggs©Ú
paragraphs)ÚORTHÚ
SENT_STARTÚENT_IOBÚENT_TYPEztest4402.spacy)ÚdocsÚattrsÚwb) rrrrÚto_bytesÚopenÚwriter ÚlistrTÚextendÚ split_sents) Ú json_datarWÚtmpdirÚ output_filerÇÚdataÚfile_ÚreaderÚ
train_dataÚsplit_train_dataÚegs rYÚtest_issue4402rÙŠððmð$%¨e¸@Ø#$¨h¸CØ#$¨c¸>Ø#$¨f¸AØ#$¨g¸BØ#$¨d¸?Ø#$¨d¸?Ø#$¨f¸AØ#$¨c¸
#ð%'ñ
ð $%¨d¸?Ø#%¨s¸?Ø#%¨u¸AØ#%¨w¸CØ#%¨t¸@Ø#%¨xÀÑDØ#%¨yÀÑEØ#%¨w¸CØ#%¨t¸@Ø#%¨t¸@Ø#%¨v¸BØ#%¨s¸
#ð%'ñð ðDÑ*°SÑñG'
ðRPð$%¨f¸AØ#$¨d¸?Ø#$¨e¸@Ø#$¨lÀ3ÑGØ#$¨iÀÑDØ#$¨g¸BØ#$¨e¸@Ø#$¨g¸BØ#$¨f¸AØ#$¨c¸ #ð%'ñð(*°4ÀÑ DÐEÐSUÑVð!ð&Ñ*°SÑñ)
ðSB
ñE€IôL ‹)€CÚ 9€EÜ Œ˜6ØÐ ܘY˜KÓܘ4 uÔØ
×
Ñ
˜
# uØ K‰K˜Ô ÷˜ ÓÜ™& &ˆ
Ü:‹ !ÒÐÛˆBØ × # B§N¡NÓ$4Õ äÐÒ
ˆ÷
#ú÷
ˆús%Ã(>F:Ä&F.Ä8A,F:Æ.F7 Æ3F:Æ:Ga
[nlp]
lang = "en"
pipeline = ["tok2vec", "tagger"]
[components]
[components.tok2vec]
factory = "tok2vec"
[components.tok2vec.model]
@architectures = "spacy.Tok2Vec.v1"
[components.tok2vec.model.embed]
@architectures = "spacy.MultiHashEmbed.v1"
width = ${components.tok2vec.model.encode:width}
attrs = ["NORM","PREFIX","SUFFIX","SHAPE"]
rows = [5000,2500,2500,2500]
include_static_vectors = false
[components.tok2vec.model.encode]
@architectures = "spacy.MaxoutWindowEncoder.v1"
width = 96
depth = 4
window_size = 1
maxout_pieces = 3
[components.tagger]
factory = "tagger"
[components.tagger.model]
@architectures = "spacy.Tagger.v2"
nO = null
[components.tagger.model.tok2vec]
@architectures = "spacy.Tok2VecListener.v1"
width = ${components.tok2vec.model.encode:width}
upstream = "*"
iuc óx
ddgd¢ifddgd¢ifg}tjtt««}gŠ
|D]<}
j t j |j|d«|d««Œ>|jˆ
fd¬ «}td
«D]}i}|j
||¬ «Œgd ¢}t|j|d¬
««}t|j|d¬
««}|ddD cgc]} | djŒc} |ddD cgc]} | djŒc} k(sJycc} wcc} w)z<Test that an empty document doesn't mess up an entire batch.zI like green eggsrM)ÚJrÛz Eat blue ham)rrDcóSrmrj)Útrain_exampless€rYú<lambda>z test_issue7029.<locals>.<lambda>sø€±NrZ)Ú get_examplesé2©ÚsgdÚlosses)ÚfirstÚsecondÚthirdÚfourthr¾ÚthenÚsomer3)Ú
batch_sizer6Néÿÿÿÿ)rÚ from_configrÚ CONFIG_7029Úappendr
rr~rr„ÚpipeÚtag_) rWÚtÚ optimizerÚiråÚtextsÚdocs1Údocs2rXs @rYÚtest_issue7029rù
s6ø€ð
˜vÒ';И&¢/ЀJô ×
Ñ
Ô2´
@€CØ€NÛ
ˆØ×Ñœg×· ± ¸Q¸q¹TÓ0BÀAÀaÁDÓàÓ,BÓC€IÜ
2ŽYˆØˆØ
> y¸ˆ
Õò
N€EÜ ˜%¨AÓ /€EÜ ˜%¨AÓ /€EØ#(¨¨"¡:Ó .¡:˜CˆC‰FKK  .È%ÐPSÐQSÉ*Ó2UÉ*À3°3°q±6·;³;È*Ñ2UÒ  UùÒ .ùÒ2Us Ã5D2ÄD7cóŠgd¢}gd¢}t|||¬«}td«td«dfg}t||«}|gd¢k(sJy)N)rr r!r%r&)TTTFT©rLrhú
I flew to zI flew to LondonÚLOC©rBrBrBúU-LOCrB©rrTr©Úen_vocabrLrhrXr{rMs rYÚtest_gold_biluo_Ur"sOÚ .€EÚ
,€FÜ
ˆh˜e¨FÔ
3€CÜ"¤CÐ(:Ó$;¸D€HÜ    /€DØ Ò  0rZcóŠgd¢}gd¢}t|||¬«}td«td«dfg}t||«}|gd¢k(sJy)N)rr r!ÚSanÚ Franciscor&)TTTTFTrûúI flew to San Franciscorý)rBrBrBrEúL-LOCrBrrs rYÚtest_gold_biluo_BLr +sOÚ 8€EÚ
2€FÜ
ˆh˜e¨FÔ
3€CÜ"¤CÐ(AÓ$BÀEÐK€HÜ    /€DØ Ò  9rZcóŠgd¢}gd¢}t|||¬«}td«td«dfg}t||«}|gd¢k(sJy)rr r!rrr#r&©TTTTTFTrûúI flew to San Francisco Valleyrý)rBrBrBrErFrrBrrs rYÚtest_gold_biluo_BILr4sOÚ B€EÚ
8€FÜ
ˆh˜e¨FÔ
3€CÜ"¤CÐ(HÓ$IÈ5ÐR€HÜ    /€DØ Ò  BrZcógd¢}gd¢}t|||¬«}td«td«dftd«td«dfg}tjt«5t ||«ddd«y#1swYyxYw)Nr r r
r)rrTÚpytestÚraisesÚ
ValueErrorr)rrLrhrXr{s rYÚtest_gold_biluo_overlapr=soÚ B€EÚ
8€FÜ
ˆh˜e¨FÔ
3€Cä ˆœCÐ @ÓAÀ5Ð ˆ\Ó œCÐ 9Ó:¸EЀHô
”zÕ "ܘc 
#× "Ñ "ús Á
A4Á4A=cóægd¢}gd¢}t|||¬«}td«td«dfg}tjt«5t ||«}ddd«gd¢k(sJy#1swYŒxYw)N)rr r!rrzValley.)TTTTTFrûr
)rBrBrBú-rr)rrTrÚwarnsÚ UserWarningrrs rYÚtest_gold_biluo_misalignrIsiÚ >€EÚ
2€FÜ
ˆh˜e¨FÔ
3€CÜ"¤CÐ(HÓ$IÈ5ÐR€HÜ ”kÕ "Ü$ S¨(Ó3ˆ÷
Ò  
#Ð "ús Á
A'Á'A0có@gd¢}gd¢}|Dcgc]}|jj|«Œ}}t||¬«}t||¬«}|jdt j
|d¬««}t
||«}|jdd¬«}|gd¢k(sJycc}w) N©rÚlikeÚstuff©r.r/r.©rLÚTAGÚuint64)ÚdtypeT©Ú as_string)ÚstringsÚaddrÚ
from_arrayÚnumpyÚarrayr
Ú get_aligned)rrLrMÚtagÚtag_idsÚ predictedÚ referencer“s rYÚtest_example_constructorr.SsÚ "€EÚ #€DÙ48Ó9±D¨SˆÑ×# CÕ(°D€GÐH *€IÜH *€IØ×$ U¬E¯K©K¸ÀxÔ,PÓQ€IÜÓ+€GØ × Ñ ˜u°Ð Ó 5€DØ Ò  +ùò
:s"Bcó–gd¢}gd¢}t||¬«}tj|d|i«}|jdd¬«}|gd¢k(sJy)NrrrÚTAGSrTr")rr
rr))rrLrMr,r“s rYÚtest_example_from_dict_tagsr1_sRÚ "€EÚ #€DÜH *€IÜ×Ñ  ¨F°D¨>Ó:€GØ × Ñ ˜u°Ð Ó 5€DØ Ò  +rZcó’gd¢}gd¢}t|||¬«}tj|d|i«}|j«}|gd¢k(sJy)ÚTTFTrûrL)NNNN©rr
rÚget_aligned_ner©rrLrhr,r“Úner_tagss rYÚtest_example_from_dict_no_nerr=hsNÚ €EÚ
&€FÜH E°&Ô9€IÜ×Ñ  ¨G°UÐ+;Ó<€GØ×(€HØ Ò  /rZcó˜gd¢}gd¢}t|||¬«}tj||gd¢dœ«}|j«}|gd¢k(sJy)Nr3r8)rÿNNN©rLr{r9r;s rYÚtest_example_from_dict_some_nerr@qsTÚ €EÚ
&€FÜH E°&Ô9€IÜ×ÑØ˜UÒ0KÑ€Gð×(€HØ Ò  2rZzignore::UserWarningc óddddddddœdddd dœd
d d d
dœdddddœdddddœgigigdœg}tt|««}t|«dk(sJ|D]}|jd«sŒJD]}|jd k(rŒJt t
|j|Dcgc]}|jŒc}|Dcgc]}t|j«Œc}¬«|«}|j«}|gd¢k(sJycc}wcc}w)NrDr¸Únnr'zMs.)ÚdepÚheadr*rr;ÚHaagr<rÚVBZÚplaysÚdobjríÚEliantir@éþÿÿÿr&)NNNNN) rrTÚhas_annotationÚent_iobr
rrUÚtextÚboolÚ whitespace_r:)rrXÚtokenÚwrØr<s rYÚtest_json_to_docs_no_nerrR|sXðð à$Ø(,°aÀÈuÑ Uà+2Ø,-Ø+0Ø,2ñ !"ð,2Ø,-Ø+0Ø,3ñ !"ð,2Ø,.Ø+0Ø,5ñ !"ð )0¸ÀCÐQTÑ Uð)'ðð"ððñ
ð" €DôF  ˜ #€DÜ ˆt9˜Š>ЈˆØ×% iÕãˆØ}‰} Óä Ü Ø I‰IÙ#&Ó'¡3˜a1—6“6 'Ù14Ó¨A”D˜ŸÕÑ
ð
ó

€Bð×#€HØ Ò  5ùò
(ùÚ5s ÂC6Â3C;có0gd¢}gd¢}gd¢}t||¬«}tj|||dœ«}|jdk(sJ|j «}t |«dk(sJ|djd k(sJ|d
jd k(sJgd ¢}gd
¢}gd¢}t||¬«}tj|||dœ«}|jdk(sJ|j «}t |«dk(sJ|djd k(sJ|d
jd k(sJy)N)rr r!úSan Francisco ValleyÚhadz loads of fun)
rr r!rrr#rUÚloadsr­Úfun)
TFFFFFTFFFr)rLriz0I flew to San Francisco Valley had loads of fun rrzI flew to San Francisco Valley rDzhad loads of fun ) rr r!rrr#rUrVzof fun)rr r!ú
San Franciscor#rUzloads ofrW)TFFFFTFF)rr
rrMrT)rrLÚ
gold_wordsrirXr“Úsplit_exampless rYÚtest_split_sentencesr[³s6â N€EÚ_€JÚV€Kä
ˆh˜
$€CÜ×Ñ ¨zÈ+Ñ%VÓW€GØ <‰<Ð  ×*€NÜ ˆ~Ó  !Ò  ˜!Ñ × !Ð%FÒ  ˜ × !Ð%8Ò  W€EÚY€JÚH€Kä
ˆh˜
$€CÜ×Ñ ¨zÈ+Ñ%VÓW€GØ <‰<Ð  ×*€NÜ ˆ~Ó  !Ò  ˜!Ñ × !Ð%FÒ  ˜ × !Ð%8Ò  8rZcórgd¢}gd¢}t|||¬«}d}t|«t|dz«dfg}gd¢}tj|||dœ«}|j «} | gd ¢k(sJtd
«td «d ft|«t|dz«dfg}gd
¢}tj|||dœ«}|j «} | gd¢k(sJtd
«td«d ft|«t|dz«dfg}gd
¢}tj|||dœ«}|j «} | gd¢k(sJy)N)úMr and ú Mrs Smithúflew torTr&©TTTFFrûúMr and Mrs Smith flew to rT)úMr and Mrs Smithr r!rrr#r&r?r]rbÚPERSON© úMr andÚMrsÚSmithr r!rrr#r&)rBúU-PERSONrBrÿrBú
Mr and Mrs)rBNrBrÿrB©rrTr
rr:©
rÚ en_tokenizerrLrhrXÚprefixr{rYr“r<s
rYÚtest_gold_biluo_one_to_manyrnÎsYÚ L€EÚ
-€FÜ
ˆh˜e¨FÔ
3€CØ
(€FÜV“œc &Ð+AÑ"AÓBÀEÐK€HÚV€JÜ×Ñ ¨zÀxÑ%PÓQ€GØ×(€HØ Ò  
ˆY‹œÐ0°(Ð ˆV”c˜&Ð#9Ñ:¸€Hò
]€Jä×Ñ ¨zÀxÑ%PÓQ€GØ×(€HØ Ò  
ˆY‹œ˜\Ó*¨HÐ ˆV”c˜&Ð#9Ñ:¸€Hò
]€Jä×Ñ ¨zÀxÑ%PÓQ€GØ×(€HØ Ò  5rZcó¨gd¢}gd¢}t|||¬«}d}t|«t|dz«dfg}gd¢}tj|||dœ«}|j «} | gd ¢k(sJtd
«td «d ft|«t|dz«dfg}gd
¢}tj|||dœ«}|j «} gd¢}
| |
k(sJy)Nrd) TTTTTTTFFrûrarT)rbr_rTr&r?) rBrBrBrBrBrErFrrBr]rbrc)rer^r_rTr&) rBrCzL-PERSONrBrBrErFrrBrj) rrlrLrhrXrmr{rYr“r<Úexpecteds rYÚtest_gold_biluo_many_to_onerqðÚ W€EÚ
E€FÜ
ˆh˜e¨FÔ
3€CØ
(€FÜV“œc &Ð+AÑ"AÓBÀEÐK€HÚM€JÜ×Ñ ¨zÀxÑ%PÓQ€GØ×(€HØ Ò  
ˆY‹œÐ0°(Ð ˆV”c˜&Ð#9Ñ:¸€HòQ€JÜ×Ñ ¨zÀxÑ%PÓQ€GØ×(€HÚV€HØ  ÐÑ rZcó¤gd¢}gd¢}t|||¬«}d}t|«t|dz«dfg}gd¢}tj|||dœ«}|j «} | gd ¢k(sJtd
«td «d ft|«t|dz«dfg}gd
¢}tj|||dœ«}|j «} | gd¢k(sJy)N)rirgr r!rXr#r&)TTTTTFFrûrarT)ÚMrz
and Mrs Smithr_rúFrancisco Valleyr&r?©rBrBrBrBrErrBr]rbrc)rer^r_rrtr&)NNrBrBrErrBrjrks
rYÚtest_gold_biluo_misalignedrvÚ Q€EÚ
9€FÜ
ˆh˜e¨FÔ
3€CØ
(€FÜV“œc &Ð+AÑ"AÓBÀEÐK€HÚS€JÜ×Ñ ¨zÀxÑ%PÓQ€GØ×(€HØ Ò  
ˆY‹œÐ0°(Ð ˆV”c˜&Ð#9Ñ:¸€HòT€JÜ×Ñ ¨zÀxÑ%PÓQ€GØ×(€HØ Ò  DrZcóòtgd¢d«\}}t|||¬«}d}t|«t|dz«dfg}gd¢}gd¢}tj||||d œ«} | j «}
|
gd
¢k(sJy) N)rr r!rXr#r&z I flew to San Francisco Valley.rûz I flew to rT)rr ú r!rTr&)TTFTFF©rLrhr{ru)rrrTr
rr:) rrlrLrhrXrmr{rYÚ gold_spacesr“r<s rYÚ%test_gold_biluo_additional_whitespacer{äM€Eˆ6ô ˆh˜e¨FÔ
3€CØ
€FÜV“œc &Ð+AÑ"AÓBÀEÐK€HÚF€JÚ9€KÜ×ÑØ z¨[ÀhÑ
€Gð×(€HØ Ò  BrZcó|d«}gd¢}gd¢}dg}tj||||dœ«}|j«}|gd¢k(sJ|d«}gd¢}gd¢}dg}tj||||dœ«}|j«}|gd ¢k(sJy)
NzI'll return the A54 amount)rú'llÚreturnrºÚ54Úamount)FTTTFTF)ÚMONEYry)rBrBrBrBzU-MONEYrBzI'll return the $54 amount)rr}r~ú$r€r)rBrBrBrBzB-MONEYzL-MONEYrB)r
rr:)rrlrXrYrzr{r“r<s rYÚtest_gold_biluo_4791r„-Ù
Ð
4€CÚC€JÚ?€KØ"€HÜ×ÑØ z¨[ÀhÑ
€Gð×(€HØ Ò 
Ð
4€CÚC€JÚ?€KØ"€HÜ×ÑØ z¨[ÀhÑ
€Gð×(€HØ Ò  FrZcó¨d}gd¢}ddg}||«}t||«}||k(sJt||«}|Dcgc]
}|dsŒ |Œ }}||k(sJycc}w)Nú$I flew to Silicon Valley via London.©rBrBrBrErrBúU-GPErB)é)éé#ÚGPEr)rr )rlrMÚ
biluo_tagsrŒrXÚbiluo_tags_convertedÚoffsets_convertedrs rYÚ'test_roundtrip_offsets_biluo_conversionrCsvØ 1€DÚE€JØ Ð0€GÙ

€CܰgÓØ    -¨c°:ÓÙ(/Ó °3°q³6š¨ÐÐ  Ò  'ùò;s
AÁAcó6|d«}gd¢}t||«}|Dcgc]}|jsŒ|Œ}}t|«dk(sJ|djdk(sJ|djdk(sJ|djdk(sJ|djd k(sJycc}w)
Nr†r‡rrzSilicon ValleyrýrDr%)r rˆrTrM)rlrXrÚspansÚspans rYÚtest_biluo_spansr”OÙ
Ð
>€CÚE€JÜ    0€EÙ 3™ed t§{£{ŠT˜e€EÐ ˆu‹:˜Š?Ј?Ø ‰8=‰=Ð  ‰8?‰?˜eÒ  ‰8=‰=˜HÒ  ‰8?‰?˜eÒ  #ùò
4s
B¯BcóÔgd¢}gd¢}t|||¬«}d}dtd«dft|«t|dz«d fg}gd
¢}tj|||d œ«}|jj
} | D
cgc]}
|
j |
jfŒc}
d d
gk(sJ|j| «} | D
cgc]}
|
j |
jfŒc}
ddgk(sJycc}
wcc}
w)N)rbr r!rTr&r`rarrbrcrT)
rsrfrgr r!rrr#r&r?©rr6©r5)rrD)r4r6) rrTr
rr-rSrÚget_aligned_spans_y2x) rrlrLrhrXrmr{Ú
tokens_refr“Úents_refrÚents_y2xs rYÚtest_aligned_spans_y2xrœ[Ú K€EÚ
-€FÜ
ˆh˜e¨FÔ
3€CØ
(€Fà
ŒCÐ #  ˆV”c˜&Ð#9Ñ:¸€Hò
`€Jä×Ñ ¨zÀxÑ%PÓQ€GØ× Ñ ×%€HÙ,4Ó 5©H SˆSY‰Y˜ŸÒ ¨HÑ 5¸&À&Ð9IÒ  ×,¨XÓ6€HÙ,4Ó 5©H SˆSY‰Y˜ŸÒ ¨HÑ 5¸&À&Ð9IÒ  Iùò 6ùâ 5s Á8C Â6C%có,d}t«}dddœdddœg}|jd«}|j|«||«}|jDcgc]}|j|j
fŒc}dd gk(sJd
}d t
d«dft
|«t
|dz«dfg} gd ¢}
tj||
| d
œ«} | jjDcgc]}|j|j
fŒc}ddgk(sJ| jj} | Dcgc]}|j|j
fŒc}dd gk(sJ| j| «}
|
Dcgc]}|j|j
fŒc}ddgk(sJycc}wcc}wcc}wcc}w)Nz-Mr and Mrs Smith flew to San Francisco Valleyrcrb)rÚpatternrýrTÚ entity_rulerrr—rar)rirgr r!rXr#r?)rr)r6r5) rr|Ú add_patternsrSrrTr
rr-r,Úget_aligned_spans_x2y)rrlrMrWÚpatternsÚrulerrXrrmr{r™r“Ú ents_predÚents_x2ys rYÚtest_aligned_spans_x2yr¦nØ :€DÜ
)€CàÐ'9ÑÐ$:Ñ€Hð
L‰L˜Ó (€EØ ×Ñ Ù
ˆd)€CØ,/¯HªHÓ 5©H SˆSY‰Y˜ŸÒ ¨HÑ 5¸&À&Ð9IÒ 
(€Fà
ŒCÐ #  ˆV”c˜&Ð#9Ñ:¸€HòR€JÜ×Ñ ¨zÀxÑ%PÓQ€GØ,3×,=Ñ,=×,BÒ,BÓ CÑ,B SˆSY‰Y˜ŸÒ Ð,BÑ ÐPVÐGWÒ  ×!×&€IÙ,5Ó 6©I SˆSY‰Y˜ŸÒ ¨IÑ 6¸6À6Ð:JÒ  ×,¨YÓ7€HÙ,4Ó 5©H SˆSY‰Y˜ŸÒ ¨HÑ 5¸&À&Ð9IÒ  Iùò 6ùò Dùò 7ùâ 5sÁFÃFÄF ÅFc ód}t«}||«}|j|«}g}d}|j|jt |«t |dz«d¬««|j|jt |«t |dz«d¬««d}||j
|<t
||«} | jj
|}
|
D cgc]} | j| jfŒc} d d
gk(sJ| j|
d ¬ «} | D cgc]} | j| jfŒc} d gk(sJ| j|
d
¬ «}
|
D cgc]} | j| jfŒc} d d
gk(sJycc} wcc} wcc} w)Nr
rXÚCITY)rrTÚVALLEYÚ overlap_ents)r4r8)r4r5F)Ú
allow_overlapT) rÚ char_spanrTrr
r-rr˜)rrlrMrWrXÚgold_docrrmÚ spans_keyr“Ú
spans_goldrÚspans_y2x_no_overlapÚspans_y2x_overlaps rYÚtest_aligned_spans_y2x_overlapr²ˆØ +€DÜ
)€CÙ
ˆd‹)€Cà|‰|˜DÓ!€HØ €EØ
€FØ ‡L×Ñœ3˜v›;¬¨F°_Ñ,DÓ(EÈVÐÓð
‡L×ÑÜ ˜VÐ&<Ñ=ÀXð ó
ôð
€IØ %€H‡NNÜc˜$€GØ×"×Ñ3€JÙ,6Ó 7©J SˆSY‰Y˜ŸÒ ¨JÑ 7¸FÀFÐ;KÒ  #× %ðÐñ-AÓ AÑ,@ SˆSY‰Y˜ŸÒ Ð,@Ñ AÀfÀXÒ  ×5°jÐPTÐÙ,=Ó >Ñ,= SˆSY‰Y˜ŸÒ Ð,=Ñ >À6È6ÐBRÒ  Rùò 8ùò Bùâ >sÃE=ÄFÅFcó||d«}gd¢}tj|d|i«}|jd«gd¢k(sJy)Nr†)NrBrBrErrBrˆrBr{)rrrr4rDrr4r)r
rr))rlrXrr“s rYÚtest_gold_ner_missing_tagsr´§sCÙ
Ð
>€CÚF€JÜ×Ñ  j°*Ð%=Ó>€GØ × Ñ ˜ )Ò-EÒ  ErZcóZ|d«}gd¢}dgt|«z}tj|||dœ«}|jd¬«\}}|jd¬«\}}|gd¢k(sJ|gd¢k(sJ|d «}d
g}dg}tj|||dœ«}|jd¬«\}}||k(sJ||k(sJt |j
d gdgd gd
g¬
«} t |j
gd¢gd¢gd¢gd¢¬
«}
t| |
«}|jd¬«\}} |dgk(sJ| dgk(sJy)NzHe pretty quickly walks away)r4rr4r4rrC)rPrQT)Ú projectivizeF)r4rr4r4r4ÚConrailrzDouble-Jointedr<)rLrhrQrP)ÚDoublerÚJointed)TTT)Úamodr@r<)rrr)rTr
rÚget_aligned_parserrU) rlrXrPrQr“Ú
proj_headsÚ proj_labelsÚ
nonproj_headsÚnonproj_labelsÚdoc_aÚdoc_bÚ proj_depss rYÚtest_projectivizerîsvÙ
Ð
6€CÚ €EØ ˆ7”S˜“ZÑ €DÜ×Ñ ¨u¸dÑ%CÓD€GØ7ÀTЀJ Ø$+×$=Ñ$=È5Ð$=Ó$QÑ!€M šÒ  šOÒ   
!€CØ
ˆC€EØ ˆ7€DÜ×Ñ ¨u¸dÑ%CÓD€GØ7ÀTЀJ Ø ˜Ò ÐÐ Ø ˜$Ò ÐÐ ô
Ø ‰ Ð+°U°GÀ6À(ÐSTÐRUô
€Eô
Ø ‰ Ú
ô 
€Eôe˜UÓ#€GØ5À4ЀJ Ø ˜$˜Ò ÐÐ Ø ˜˜Ò ÐÑ rZcó²gd¢}gd¢}gd¢}t|«}||k(sJtjt«5t|«ddd«y#1swYyxYw)N)rBrBrErFrBrC)rBrBrErrBrh)rBrBú"rErF)rrrr)Úgood_iobÚ
good_biluoÚbad_iobÚconverted_biluos rYÚtest_iob_to_biluorÊÒsIÚ<€HÚ>€JÚ/€GÜ" ,€OØ ˜Ò  ”zÕ "Ü÷
#× "Ñ "ús ¸ A
Á
AcóH|j}|Dcgc]}|jŒ}}|Dcgc]}|jŒ}}|Dcgc]}|jŒ}}|Dcgc]}t |j
«Œ}}|Dcgc]}|j Œ}}|Dcgc]}|jŒ}}|Dcgc]}|jjŒ} }|j}
|jD cgc]%} | j| j| jfŒ'} } t«5}
t!«}|
dz }t#j$|t'|«g«|
dz }t)|g¬«j+|«t-|«}t/||««}ddd«t1|«t3dD««k(sJ|d}||j4jk(sJ||j4Dcgc]}|jŒc}k(sJ||j4Dcgc]}|jŒc}k(sJ||j4Dcgc]}|jŒc}k(sJ||j4Dcgc]}t |j
«Œc}k(sJ||j4Dcgc]}|j Œc}k(sJ||j4Dcgc]}|jŒc}k(sJ| |j4Dcgc]}|jjŒc}k(sJ| |j4jD cgc]%} | j| j| jfŒ'c} k(sJd|j4jvsJd|j4jvsJ|
d|j4jdk(sJ|
d|j4jdk(sJycc}wcc}wcc}wcc}wcc}wcc}wcc}wcc} w#1swYŒexYwcc}wcc}wcc}wcc}wcc}wcc}wcc}wcc} w)Nzroundtrip.jsonzroundtrip.spacy)c3ó2K|]}t|«Œy­wrm)rT)Ú.0rØs rYú <genexpr>z0test_roundtrip_docs_to_docbin.<locals>.<genexpr>ñsèø€Ð?Ñ-> rœ3˜rŸ7Ñ->ùsrrJrK)rMÚidxròÚpos_ÚstrÚmorphÚlemma_Údep_rDrVrSr†r‡rˆrrÚsrslyÚ
write_jsonr
rr…r rTÚsumr-)rXrMrMrNrOrRrQrPrVÚerSÚ reloaded_nlpÚ json_filerÒÚreloaded_examplesÚreloaded_examples rYÚtest_roundtrip_docs_to_docbinrÝÜØ 8‰8€DÙÓ
™#Qˆ155˜#€CÐ
ÙÓ ™CqˆAF‹F˜C€DÐ ÙÓ
™3aˆ166˜3€CÐ
Ù$'Ó
(¡C˜qŒc!—''l C€FÐ
(Ù #Ó
˜1ˆahh €FÐ