Files
INTUIA/Programa final/spacy/pipeline/__pycache__/edit_tree_lemmatizer.cpython-312.pyc
T

129 lines
22 KiB
Plaintext
Raw Normal View History

2026-03-15 13:27:50 +00:00
Ë
>û gˆ=ãóÌddlmZddlmZddlmZmZmZmZm Z m
Z
m Z m Z ddl
ZddlZddlmZmZmZmZddlmZmZddlmZdd lmZdd
lmZdd lmZdd l m!Z!m"Z"m#Z#dd
l$m%Z%ddl&m'Z'ddl(m)Z)ddl*m+Z+ddl,m-Z-dZ.dZ/e«jae/«dZ1ejdddgge1ddddddidœddi¬ «d!ed"e3ded#e
e3d$e4d%e5d&e4d'e
efd(„«Z6Gd)„d*e-«Z7y)+é)ÚCounter)Úislice)ÚAnyÚCallableÚDictÚIterableÚListÚOptionalÚTupleÚcastN)ÚConfigÚModelÚNumpyOpsÚSequenceCategoricalCrossentropy)ÚFloats2dÚInts2dé)Úutil)ÚErrors)ÚLanguage)ÚDoc)ÚExampleÚvalidate_examplesÚvalidate_get_examples)ÚVocabé)Ú EditTrees)Úvalidate_edit_tree)Úlemmatizer_score)Ú
TrainablePipeé
[model]
@architectures = "spacy.Tagger.v2"
[model.tok2vec]
@architectures = "spacy.HashEmbedCNN.v2"
pretrained_vectors = null
width = 96
depth = 4
embed_size = 2000
window_size = 1
maxout_pieces = 3
subword_features = true
ÚmodelÚtrainable_lemmatizerz token.lemmaÚorthéFz@scorerszspacy.lemmatizer_scorer.v1)r"ÚbackoffÚ
min_tree_freqÚ overwriteÚtop_kÚscorerÚ lemma_accçð?)ÚassignsÚrequiresÚdefault_configÚdefault_score_weightsÚnlpÚnamer&r'r(r)r*c
ó<t|j|||||||¬«S)z*Construct an EditTreeLemmatizer component.©r&r'r(r)r*)ÚEditTreeLemmatizerÚvocab)r1r2r"r&r'r(r)r*s údC:\Users\garci\AppData\Roaming\Python\Python312\site-packages\spacy/pipeline/edit_tree_lemmatizer.pyÚmake_edit_tree_lemmatizerr8*s-ô2 Ø ‰ Ø
Ø ØØØØô ð ócóÎeZdZdZ d,ddddedœdeded ed
eed e d e
d
e dee fdZ de
edeedeeeeffdZde
edeefdZdZdZdZde
efdZedee dffd«Zede
fd«Zedefd«Zdddœd e ge
efd!ee d"eefd#„Z!e"«d$œd%„Z#e"«d$œd&„Z$e"«fd'„Z%e"«fd(„Z&d"efd)„Z'd e ge
effd*„Z(d-d+„Z)y).r5zK
Lemmatizer that lemmatizes each word using a predicted edit tree.
r$r%Frr4r6r"r2r&r'r(r)r*cóú||_||_||_||_||_||_||_t|jj«|_ i|_
dgi|_ ||_ t«|_y)
Construct an edit tree lemmatizer.
backoff (Optional[str]): backoff to use when the predicted edit trees
are not applicable. Must be an attribute of Token or None (leave the
lemma unset).
min_tree_freq (int): prune trees that are applied less than this
frequency in the training data.
overwrite (bool): overwrite existing lemma annotations.
top_k (int): try to apply at most the k most probable edit trees.
ÚlabelsN)r6r"r2r&r'r(r)rÚstringsÚtreesÚ
tree2labelÚcfgr*rÚ numpy_ops) Úselfr6r"r2r&r'r(r)r*s r7Ú__init__zEditTreeLemmatizer.__init__Tsnð.ˆŒ
؈Œ
؈Œ ؈Œ ØÔ،؈Œ
ä˜tŸz™z׌
Ø*,ˆŒà$,¨b >ˆŒØˆŒ Ü!ˆr9ÚexamplesÚscoresÚreturncózt|d«tdd¬«}g}|D}g}t|j|j dd¬««D]b\}}||dk(rd} nB|j
j
|j|«}
|jj|
d «} |j| «Œd|j|«Œ£|||«\} } |jjjj| «r3tt j"j%|j&¬
««t)| «| fS) NzEditTreeLemmatizer.get_lossFéÿÿÿÿ)Ú normalizeÚ
missing_valueÚLEMMAT)Ú as_stringÚr©r2)rrÚzipÚ predictedÚ get_alignedr>ÚaddÚtextr?ÚgetÚappendr"ÚopsÚxpÚisnanÚ
ValueErrorrÚE910Úformatr2Úfloat)
rBrDrEÚ loss_funcÚtruthsÚegÚ eg_truthsrPÚ
gold_lemmaÚlabelÚtree_idÚd_scoresÚlosss
r7Úget_losszEditTreeLemmatizer.get_losszsô ˜(Ð$AÔ3¸eÐSUÔVˆ àˆÛˆB؈IÜ+.Ø ˜bŸn™n¨WÀ˜nÓ,Ñ'˜Ð°rÒ)9Ø"Ÿj™jŸn™n¨Y¯^©^¸H ŸO™O׸Ó;× Ñ  Õ,ð
M‰M˜)Õ ñ# 6¨6Ó2‰ˆ :‰:>‰>× Ñ × "  œVŸ[™[×/°T·Y±YÐ T{˜$r9Údocscó.|jdk(r
|j}n,|jtkr
|j}n |j}t t
|««}td|D««s]t |jd«}|Dcgc](}|jjjd|«Œ*}}t |«|k(sJ|S|jj|«}t |«|k(sJ|||«}t |«|k(sJ|Scc}w)Nrc3ó2K|]}t|«Œy­w©N)Úlen)Ú.0Údocs r7ú <genexpr>z-EditTreeLemmatizer.predict.<locals>.<genexpr>£sèø€Ð,¡t ”3s—8¡tùsr<r)
r)Ú_scores2guesses_top_k_equals_1ÚTOP_K_GUARDRAILÚ_scores2guesses_top_k_greater_1Ú_scores2guesses_top_k_guardrailrkÚlistÚanyr@r"rVÚalloc2iÚpredict)rBrgÚscores2guessesÚn_docsÚn_labelsÚguessesrEs r7rvzEditTreeLemmatizer.predict•Ø :‰:˜Š?Ø@‰NØ
Z‰Zœ?Ò
A‰NàAˆNô”T˜$“Z“ˆÜÑ,¡tÓ˜4Ÿ8™8 .ˆHÙRVÓ$WÑRVÈQ T§Z¡Z§^¡^×%;Ñ%;¸A¸xÕ%HÐRVˆGÐ$WÜw“<  ˆ×# DÓÜ6‹{˜fÒ   vÓÜ7‹|˜vÒˆùò%Xs -Dcóºg}t||«D\}}|jd¬«}|jj|«}g}t |«D]c\}} |j
d||}
|j j|
| j«|j|
«ŒS|jd«Œe|jtj|««ŒË|S)Nr)Úaxisr<rH) rOÚargmaxrAÚasarrayÚ enumerater@r>ÚapplyrSrUÚnpÚarray) rBrgrEr{rmÚ
doc_scoresÚ doc_guessesÚdoc_compat_guessesÚtokenrcs r7roz1EditTreeLemmatizer._scores2guesses_top_k_equals_1¯ØˆÜ" 4¨Ö0‰OˆCØ$×Ð3ˆKØŸ.™.×Ó=ˆKà!#Ð Ü% cžN‘Ÿ(™( ,¨[¸©^Ñ<Ø—::×# G¨U¯Z©ZÓ&×-¨gÕ&×-¨bÕ 
N‰Nœ2Ÿ8™8Ð$6Ó ˆr9có¤g}t|jt|j««}t ||«D]\}}|j
j
|«}g}t|«D]¿\}} t|«D]}
t||j««} |jd| } |jj| | j«|j| «Œ{t!j"t j$«j||| f<Œ|jd«ŒÁ|jt!j&|««Œ|S)Nr<rH)Úminr)rkr<rOrArr€ÚrangeÚintr~r@r>rrSrUrÚfinfoÚfloat32rƒ)
rBrgrEr{r)rmr„r†r‡rˆrzÚ candidateÚcandidate_tree_ids
r7rqz2EditTreeLemmatizer._scores2guesses_top_k_greater_1Às؈ÜD—J‘J¤ D§K¡KÓ 0Ó1ˆÜ" ×0‰OˆCØŸ×
Ó;ˆJØ!#Ð Ü% cžN˜AÜ # J¨q¡M×$8Ñ$8Ó$:Ó ;IØ(,¯©°Ñ(:¸9Ñ(EЗzz×'Ð(9¸5¿:¹:Ó*×1Ð2CÔÜ/1¯x©x¼¿
¹
Ó/C×/GÑ/GJ˜q )˜
'×-¨bÕ
N‰Nœ2Ÿ8™8Ð$6Ó ˆr9cóÞg}t||«D\}}tj|«dd|j dz
df}|jj |«}g}t||«D][\}} d}
| D]>} |j d| } |jj| |j«Œ<| }
n|j|
«Œ]|jtj|««ŒÝ|S)N.rrHr<) rOrÚargsortr)rArr@r>rrSrU)
rBrgrEr{rmr„r…r†rˆÚ
candidatesrcrrs
r7rrz2EditTreeLemmatizer._scores2guesses_top_k_guardrailÔØˆÜ" 4¨Ö0‰OˆCÜŸ*™* ZÓÐ6L¸¿¹¸ Àa¹È"Ð6LÐ1LÑMˆŸ.™.×Ó=ˆKà!#Ð Ü%(¨¨kÖ%:Ñ!Û!+IØ(,¯©°Ñ(:¸9Ñ(EЗzz×'Ð(9¸5¿:¹:ÓRØ"3˜Ùð ",ð #×)¨'Õ&;ð
N‰Nœ2Ÿ8™8Ð$6Ó  1ð"ˆr9cóžt|«D]¿\}}||}t|d«r|j«}t|«D]\}}|js||jdk(sŒ%|dk(r/|j
Œ7t
|||j
«||_ŒY|jj|||j«}|||_
ŒŒÁy)NrTrrH) r€ÚhasattrrTr(Úlemmar&Úgetattrr>rrSÚlemma_) rBrgÚbatch_tree_idsr‡rmÚ doc_tree_idsÚjrcrs r7Úset_annotationsz"EditTreeLemmatizer.set_annotationséÜ –o‰FˆAˆsØ)¨!Ñ,ˆLÜ| +×1 Ü Ö5
—>> ¡V§\¡\°QÓ%6ð "Ÿ<™<Ñ3Ü+2°3°q±6¸4¿<¹<Ó+H˜C ™FLà $§
¡
× 0Ñ 0°¸#¸a¹&¿+¹+Ó F˜Ø(-˜˜A™
ñ&r9.có2t|jd«S)z4Returns the labels currently added to the component.r<)Útupler@©rBs r7r<zEditTreeLemmatizer.labelsúsôT—XX˜(r9cóy)NT©s r7Ú hide_labelszEditTreeLemmatizer.hide_labelsÿsàr9có^g}tt|j««D]h}|j|}d|vr|jj|d|d<d|vr|jj|d|d<|j |«Œjt
|t|jd«¬«S)origÚsubstr<)r>r<) rrkr>r6r=rUÚdictržr@)rBr>rcÚtrees r7Ú
label_datazEditTreeLemmatizer.label_datasàˆÜœS §¡›_Ö-ˆGØ—:‘:˜gÑ&ˆ˜‰~Ø#Ÿz™z×1°$°v±,Ñ?V‘ ؘ$‰Ø $§
¡
× 2Ñ 2°4¸±=Ñ AW‘
Ø L‰L˜Õ ð
˜%¬¨d¯h©h°xÑ.@Ó(AÔBr9N)r1r<Ú get_examplesr1r<c ó°t|d«||j|«n|j|«g}g}t|«d«D}|j |j
«g}|j D]m}|jdk(rd} n&|j|j|j«} |j |jdD
cgc] }
|
| k(rdndŒ
c}
«Œott|«}|j |jjj!|d¬««Œã|j#«t%|«dkDs/Jt&j(j+|j,¬ ««t%|«dkDs/Jt&j(j+|j,¬ ««|jj/||¬
«ycc}
w) NzEditTreeLemmatizer.initializeé
rr<r,g)ÚdtyperN)ÚY)rÚ_labels_from_dataÚ _add_labelsrrUÚxÚ referencerÚ _pair2labelrSr˜r@r rr"rVrÚ_require_labelsrkrÚE923r[r2Ú
initialize) rBr1r<Ú
doc_sampleÚ label_sampleÚexampleÚ gold_labelsrˆÚ
gold_labelrbs r7zEditTreeLemmatizer.initializeô ˜lÐ,KÔ ˆ × "  × Ñ ˜ ˆ
؈ Ü™l›n¨bÖ1ˆGØ × Ñ ˜gŸi™iÔ (Ø-/ˆKØ ×*Ø—;; #Ø!%Jà!%×!1Ñ!1°%·*±*¸e¿l¹lÓ!K×&*§X¡X¨hÒ%7óá%7˜
Ò2™¸Ñ;Ø%7ñõð
œx¨Ó5ˆKØ × Ñ  §
¡
§¡× 6Ñ 6°{È)Ð 6Ó TÕ Uð#2ð&
×ÑÔÜ:‹ ÒF¤F§K¡K×$6Ñ$6¸D¿I¹IÐ$6Ó$FÓ  H¤f§k¡k×&8Ñ&8¸d¿i¹iÐ&8Ó&HÓ
×Ñ 
¨lÐÕ;ùòsÃ