Programa%20final/spacy/training/__pycache__/iob_utils.cpython-312.pyc

Ë
?û gc$ãóÚ—ddlZddlmZmZmZmZmZmZmZddl	m
Z
mZddlm
Z
mZdeedeefd„Zdeedeefd	„Zdeedeefd
„Zdeedeefd„Zdde
d
efd„Zde
deefd„Z	dde
deeeeeeeffd
edeefd„Zde
deedeefd„Zde
deedeeeeeeefffd„Zdeedeeeeeffd„Zdedeeeffd„Zdedefd„ZeZeZeZy)éN)ÚDictÚIterableÚIteratorÚListÚTupleÚUnionÚcasté)ÚErrorsÚWarnings)ÚDocÚSpanÚtagsÚreturncó’—g}t|«}|r7|jt|««|jt|««|rŒ7|S)N)ÚlistÚextendÚ_consume_osÚ_consume_ent)rÚouts  úYC:\Users\garci\AppData\Roaming\Python\Python312\site-packages\spacy/training/iob_utils.pyÚiob_to_biluorsA€Ø€CÜ�‹:€DÙ
Ø�
‰
”;˜tÓ$Ô%Ø�
‰
”< Ó%Ô&òð€Jócóª—g}|D]K}|€|j|«Œ|jddd«jddd«}|j|«ŒM|S)NúU-úB-éúL-úI-)ÚappendÚreplace)rrÚtags   rÚbiluo_to_iobr#sT€Ø
€CÛˆØˆ;Ø�J‰J�s�Oà—+‘+˜d D¨!Ó,×4Ñ4°T¸4ÀÓCˆCØ�J‰J�s�Oðð€Jrc#óbK—|r)|ddk(r |jd«–—|r
|ddk(rŒyyyyw)NrÚO)Úpop)rs rrrs6èø€Ù
�4˜‘7˜c’>Ø�h‰h�q‹kÒñ�4˜‘7˜c”>ˆ$�>ˆ$ùs‚(/«/có¨—|sgS|jd«}d|ddz}d|ddz}d}|r+|d||hvr"|dz
}|jd«|r
|d||hvrŒ"|dd}|dk(r=t|«dk(r)ttjj|¬««‚d|zgSd|z}d	|z}t
d|dz
«D�cgc]}d
|›�‘Œ	}	}|g|	z|gzScc}w)NrÚIrÚLr
©r"rrrr)r&ÚlenÚ
ValueErrorrÚE177ÚformatÚrange)
rr"Ú	target_inÚtarget_lastÚlengthÚlabelÚstartÚendÚ_Úmiddles
          rrr!s€ÙØˆ	Ø
�(‰(�1‹+€CØ�c˜!˜"�g‘
€IØ˜˜A˜B˜‘-€KØ
€FÙ
�4˜‘7˜y¨+Ð6Ñ6Ø�!‰ˆØ�‰�Œñ�4˜‘7˜y¨+Ð6Ò6ð
��ˆG€EØ
�‚{Üˆu‹:˜Š?ÜœVŸ[™[×/Ñ/°CÐ/Ó8Ó9Ð9Ø�u‘ˆ~Ðà�u‘ˆØ�U‰lˆÜ(-¨a°¸!±Ô(<Ó=Ñ(< 1�B�u�g’,Ð(<ˆÐ=Øˆw˜Ñ 3 %Ñ'Ð'ùò>sÂ7CÚdocÚmissingc	óš—t||jD�cgc]%}|j|j|jf‘Œ'c}|¬«Scc}w)N©r9)Úoffsets_to_biluo_tagsÚentsÚ
start_charÚend_charÚlabel_)r8r9Úents   rÚdoc_to_biluo_tagsrB7sA€Ü ØØ?B¿xºxÓH¹x¸ˆ#�.‰.˜#Ÿ,™,¨¯
©
Ò	3¸xÑHØôðùâHs•*A
cóp—t|d¬«}t|«D]\}}|jdk(sŒd||<Œ|S)Nú-r;r
r%)rBÚ	enumerateÚent_iob)r8r=ÚiÚtokens    rÚ_doc_to_biluo_tags_with_partialrI?s<€Ü˜S¨#Ô.€DÜ˜c–N‰ˆˆ5Ø�=‰=˜AÓØˆD�ŠGð#ð€KrÚentitiesc
ó
—i}|D�cic]}|j|j“Œ}}|D�cic]%}|jt|«z|j“Œ'}}|D�cgc]}d‘Œ}}|D]ó\}	}
}|s|D]}||	k\sŒ	||
ksŒd|||<ŒŒ%t|	|
«D]^}
|
|j	«vrBttjj||
d||
d||
df|	|
|f¬««‚|	|
|f||
<Œ`|j|	«}|j|
«}|€Œ·|€Œº||k(r	d|›�||<ŒÈd	|›�||<t|dz|«D]
}d
|›�||<Œd|›�||<Œõt«}|D](\}	}
}t|	|
«D]}|j|«ŒŒ*|D]H}t|j|jt|«z«D]}||vsŒŒ9|||j<ŒJd|vrŽ|dk7r‰t|«}tjtj jt|j"«dkDr|j"ddd
zn|j"t|«dkDr|ddd
zn|¬««|Scc}wcc}wcc}w)u¸Encode labelled spans into per-token tags, using the
    Begin/In/Last/Unit/Out scheme (BILUO).

    doc (Doc): The document that the entity offsets refer to. The output tags
        will refer to the token boundaries within the document.
    entities (iterable): A sequence of `(start, end, label)` triples. `start`
        and `end` should be character-offset integers denoting the slice into
        the original string.
    missing (str): The label used for missing values, e.g. if tokenization
        doesnâ€™t align with the entity offsets. Defaults to "O".
    RETURNS (list): A list of unicode strings, describing the tags. Each tag
        string will be of the form either "", "O" or "{action}-{label}", where
        action is one of "B", "I", "L", "U". The missing label is used where the
        entity offsets don't align with the tokenization in the `Doc` object.
        The training algorithm will view these as missing values. "O" denotes a
        non-entity token. "B" denotes the beginning of a multi-token entity,
        "I" the inside of an entity of three or more tokens, and "L" the end
        of an entity of two or more tokens. "U" denotes a single-token entity.

    EXAMPLE:
        >>> text = 'I like London.'
        >>> entities = [(len('I like '), len('I like London'), 'LOC')]
        >>> doc = nlp.tokenizer(text)
        >>> tags = offsets_to_biluo_tags(doc, entities)
        >>> assert tags == ["O", "O", 'U-LOC', "O"]
    rDr%rrr
)Úspan1Úspan2Nrrrré2z...)ÚtextrJ)ÚidxrGr+r/Úkeysr,rÚE103r.ÚgetÚsetÚaddÚstrÚwarningsÚwarnrÚW030rO)r8rJr9Útokens_in_entsrHÚstartsÚendsr6Úbiluor>r?r3ÚsÚtoken_indexÚstart_tokenÚ	end_tokenrGÚentity_charsÚent_strs                   rr<r<GsÌ€ð<CE€NÙ.1Ó
2©c Uˆe�i‰i˜Ÿ™Ñ ¨c€FÐ
2Ù9<Ó=¹°ˆE�I‰Iœ˜E›
Ñ" E§G¡GÑ+¸€DÐ=ÙÓ™#�QŠS˜#€EÐã'/Ñ#ˆ
�H˜eÙÛ�Ø˜
“? q¨8£|Ø'*�E˜& ™)Ò$ñô % Z°Ö:�Ø .×"5Ñ"5Ó"7Ñ7Ü$ÜŸ™×*Ñ*à .¨{Ñ ;¸AÑ >Ø .¨{Ñ ;¸AÑ >Ø .¨{Ñ ;¸AÑ >ð#ð
$.¨x¸Ð"?ð
+óó	ð	ð0:¸8ÀUÐ.K�˜{Ò+ð ;ð!Ÿ*™* ZÓ0ˆKØŸ™ Ó*ˆIàÑ&¨9Ñ+@Ø )Ò+Ø+-¨e¨W¨�E˜+Ò&à+-¨e¨W¨�E˜+Ñ&Ü" ;°¡?°IÖ>˜Ø%'¨ w <˜˜ašð?à)+¨E¨7 |�E˜)Ò$ð;(0ô>“5€LÛ'/Ñ#ˆ
�H˜eÜ�z 8Ö,ˆAØ×Ñ˜QÕñ-ð(0óˆÜ�u—y‘y %§)¡)¬c°%«jÑ"8Ö9ˆAØ�LÒ Ùð:ð%ˆE�%—'‘'ŠNððˆe�|˜ 3šÜ�h“-ˆÜ�
‰
Ü�M‰M× Ñ Ü.1°#·(±(«m¸bÒ.@�S—X‘X˜c˜r�] UÒ*ÀcÇhÁhÜ14°W³ÀÒ1B˜  "˜¨Ò-Èð
!ó
ô