Programa%20final/spacy/pipeline/__pycache__/lemmatizer.cpython-312.pyc

Ë
>û gâ0ãó°—ddlZddlmZddlmZmZmZmZmZm	Z	m
Z
mZddlm
Z
ddlmZddlmZmZddlmZdd	lmZmZdd
lmZddlmZmZddlmZdd
lmZm Z m!Z!ddl"m#Z#ddl$m%Z%ejLddgdddddidœddi¬«dede	e
de'de'de(d e	efd!„«Z)d"eed#ee'effd$„Z*e!jVd«d%„«Z,Gd&„d'e%«Z-y)(éN)ÚPath)ÚAnyÚCallableÚDictÚIterableÚListÚOptionalÚTupleÚUnion)ÚModelé)Úutil)ÚErrorsÚWarnings)ÚLanguage)ÚLookupsÚload_lookups)ÚScorer)ÚDocÚToken)ÚExample)ÚSimpleFrozenListÚloggerÚregistry)ÚVocabé)ÚPipeÚ
lemmatizerztoken.lemmaÚlookupFz@scorerszspacy.lemmatizer_scorer.v1)ÚmodelÚmodeÚ	overwriteÚscorerÚ	lemma_accgð?)ÚassignsÚdefault_configÚdefault_score_weightsÚnlpr Únamer!r"r#có8—t|j|||||¬«S)N©r!r"r#)Ú
LemmatizerÚvocab)r(r r)r!r"r#s      úZC:\Users\garci\AppData\Roaming\Python\Python312\site-packages\spacy/pipeline/lemmatizer.pyÚmake_lemmatizerr/s"€ô&Ø�	‰	�5˜$ T°YÀvôðóÚexamplesÚreturncó0—tj|dfi|¤ŽS)NÚlemma)rÚscore_token_attr)r1Úkwargss  r.Úlemmatizer_scorer7+s€Ü×"Ñ" 8¨WÑ?¸Ñ?Ð?r0có—tS©N)r7©r0r.Úmake_lemmatizer_scorerr;/s€äÐr0cóø—eZdZdZededeeeeeffd„«Z	d%dde	dœde
d	eed
edede
deedd
fd„Zed„«Zdedefd„Z	d&d
d
dœdeegeefdeedeefd„Zej4fdedd
fd„Zdedeefd„Zdedeefd„Zdede
fd„Ze «dœde!ee"fdeefd „Z#e «dœde!ee"fdeeddfd!„Z$e «dœdeede%fd"„Z&e «dœd#e%deeddfd$„Z'y
)'r,z�
    The Lemmatizer supports simple part-of-speech-sensitive suffix rules and
    lookup tables.

    DOCS: https://spacy.io/api/lemmatizer
    r!r2có6—|dk(rdggfS|dk(rdgddgfSggfS)aReturns the lookups configuration settings for a given mode for use
        in Lemmatizer.load_lookups.

        mode (str): The lemmatizer mode.
        RETURNS (Tuple[List[str], List[str]]): The required and optional
            lookup tables for this mode.
        rÚlemma_lookupÚruleÚlemma_rulesÚ	lemma_excÚlemma_indexr:)Úclsr!s  r.Úget_lookups_configzLemmatizer.get_lookups_config<s<€ð�8ÒØ#Ð$ bÐ)Ð)Ø
�VŠ^Ø"�O k°=Ð%AÐBÐBØ�Bˆxˆr0rFr+r-r r)r"r#NcóÀ—||_||_||_||_t	«|_||_d|_|jdk(r|j|_
nv|jdk(r|j|_
nU|j›d�}t||«s)ttjj!|¬««‚t#||«|_
i|_||_y)a&Initialize a Lemmatizer.

        vocab (Vocab): The vocab.
        model (Model): A model (not yet implemented).
        name (str): The component name. Defaults to "lemmatizer".
        mode (str): The lemmatizer mode: "lookup", "rule". Defaults to "lookup".
        overwrite (bool): Whether to overwrite existing lemmas. Defaults to
            `False`.
        scorer (Optional[Callable]): The scoring method. Defaults to
            Scorer.score_token_attr for the attribute "lemma".

        DOCS: https://spacy.io/api/lemmatizer#init
        Frr?Ú
_lemmatize)r!N)r-r r)Ú_moderÚlookupsr"Ú
_validatedr!Úlookup_lemmatizeÚ	lemmatizeÚrule_lemmatizeÚhasattrÚ
ValueErrorrÚE1003ÚformatÚgetattrÚcacher#)Úselfr-r r)r!r"r#Ú	mode_attrs        r.Ú__init__zLemmatizer.__init__Ks¼€ð.ˆŒ
ØˆŒ
ØˆŒ	ØˆŒ
Ü“yˆŒØ"ˆŒØˆŒØ�9‰9˜Ò Ø!×2Ñ2ˆD�NØ
�Y‰Y˜&Ò
 Ø!×0Ñ0ˆD�NàŸ9™9˜+ ZÐ0ˆIÜ˜4 Ô+Ü ¤§¡×!4Ñ!4¸$Ð!4Ó!?Ó@Ð@Ü$ T¨9Ó5ˆDŒNØˆŒ
Øˆ�r0có—|jSr9)rG)rSs r.r!zLemmatizer.modeus€à�z‰zÐr0ÚdoccóN—|js|jtj«|j	«}	|D]7}|j
s|jdk(sŒ|j|«d|_Œ9|S#t$r }||j||g|«Yd}~yd}~wwxYw)z´Apply the lemmatizer to one document.

        doc (Doc): The Doc to process.
        RETURNS (Doc): The processed Doc.

        DOCS: https://spacy.io/api/lemmatizer#call
        rN)rIÚ_validate_tablesrÚE1004Úget_error_handlerr"r4rKÚlemma_Ú	Exceptionr))rSrWÚ
error_handlerÚtokenÚes     r.Ú__call__zLemmatizer.__call__ys�€ð�ŠØ×!Ñ!¤&§,¡,Ô/Ø×.Ñ.Ó0ˆ
ð	5Û�Ø—>’> U§[¡[°AÓ%5Ø#'§>¡>°%Ó#8¸Ñ#;�E•LððˆJøÜò	5Ù˜$Ÿ)™) T¨C¨5°!×4Ñ4ûð	5ús½ A;ÁA;Á;	B$ÂBÂB$)r(rHÚget_examplesr(rHcó¤—|j|j«\}}|€Štjd«t	|j
j|¬«}t	|j
j|d¬«}|jD]#}|j||j|««Œ%||_
|jtj«y)aÑInitialize the lemmatizer and load in data.

        get_examples (Callable[[], Iterable[Example]]): Function that
            returns a representative sample of gold-standard Example objects.
        nlp (Language): The current nlp object the component is part of.
        lookups (Lookups): The lookups object containing the (optional) tables
            such as "lemma_rules", "lemma_index", "lemma_exc" and
            "lemma_lookup". Defaults to None.
        Nz2Lemmatizer: loading tables from spacy-lookups-data)ÚlangÚtablesF)rdreÚstrict)rDr!rÚdebugrr-rdreÚ	set_tableÚ	get_tablerHrYrrZ)rSrbr(rHÚrequired_tablesÚoptional_tablesÚoptional_lookupsÚtables        r.Ú
initializezLemmatizer.initializeŒs¤€ð ,0×+BÑ+BÀ4Ç9Á9Ó+MÑ(ˆ˜Øˆ?Ü�L‰LÐMÔNÜ"¨¯
©
¯©ÀÔPˆGÜ+Ø—Z‘Z—_‘_¨_ÀUô Ðð*×0Ô0�Ø×!Ñ! %Ð)9×)CÑ)CÀEÓ)JÕKð1àˆŒØ×ÑœfŸl™lÕ+r0Ú
error_messagecóî—|j|j«\}}|D]K}||jvsŒt|j	|j||jj
¬««‚d|_y)z8Check that the lookups are correct for the current mode.)r!reÚfoundTN)rDr!rHrNrPrerI)rSrorjrkrms     r.rYzLemmatizer._validate_tables¨sq€à+/×+BÑ+BÀ4Ç9Á9Ó+MÑ(ˆ˜Û$ˆEØ˜DŸL™LÒ(Ü Ø!×(Ñ(Ø!ŸY™YØ.Ø"Ÿl™l×1Ñ1ð)óóðð%ðˆ�r0r_có°—|jjdi«}|j|j|j«}t	|t
«r|g}|S)zÞLemmatize using a lookup-based approach.

        token (Token): The token to lemmatize.
        RETURNS (list): The available lemmas for the string.

        DOCS: https://spacy.io/api/lemmatizer#lookup_lemmatize
        r>)rHriÚgetÚtextÚ
isinstanceÚstr)rSr_Úlookup_tableÚresults    r.rJzLemmatizer.lookup_lemmatize¶sJ€ð—|‘|×-Ñ-¨n¸bÓAˆØ×!Ñ! %§*¡*¨e¯j©jÓ9ˆÜ�fœcÔ"Ø�XˆFØˆ
r0cót—|j|j|jjf}||jvr|j|S|j
}|jj«}|dvr9|dk(r#tjtj«|j«gS|j|«r|j«gS|jjdi«}|jjdi«}|jjdi«}t|j!|«|j!|«|j!|«f«s|dk(r|gS|j«gS|j!|i«}|j!|i«}	|j!|i«}
|}|j«}g}g}
|
D]n\}}|j#|«sŒ|dt%|«t%|«z
|z}|sŒ8||vs|j'«s|j)|«Œ^|
j)|«Œpt+t,j/|««}|	j!|g«D]}||vsŒ|j1d|«Œ|s|j3|
«|s|j)|«||j|<|S)	zÚLemmatize using a rule-based approach.

        token (Token): The token to lemmatize.
        RETURNS (list): The available lemmas for the string.

        DOCS: https://spacy.io/api/lemmatizer#rule_lemmatize
        )ÚÚeolÚspacerzrBrAr@ÚpropnNr)ÚorthÚposÚmorphÚkeyrRrtÚpos_ÚlowerÚwarningsÚwarnrÚW108Úis_base_formrHriÚanyrsÚendswithÚlenÚisalphaÚappendÚlistÚdictÚfromkeysÚinsertÚextend)rSr_Ú	cache_keyÚstringÚuniv_posÚindex_tableÚ	exc_tableÚrules_tableÚindexÚ
exceptionsÚrulesÚorigÚformsÚ	oov_formsÚoldÚnewÚforms                 r.rLzLemmatizer.rule_lemmatizeÄs\€ð—Z‘Z §¡¨E¯K©K¯O©OÐ<ˆ	Ø˜Ÿ
™
Ñ"Ø—:‘:˜iÑ(Ð(Ø—‘ˆØ—:‘:×#Ñ#Ó%ˆØÐ+Ñ+Ø˜2Š~Ü—
‘
œhŸm™mÔ,Ø—L‘L“NÐ#Ð#à×Ñ˜UÔ#Ø—L‘L“NÐ#Ð#Ø—l‘l×,Ñ,¨]¸BÓ?ˆØ—L‘L×*Ñ*¨;¸Ó;ˆ	Ø—l‘l×,Ñ,¨]¸BÓ?ˆÜà—‘ Ó)Ø—
‘
˜hÓ'Ø—‘ Ó)ð
ô
ð˜7Ò"Ø�x�àŸ™›Ð'Ð'à—‘ ¨"Ó-ˆØ—]‘] 8¨RÓ0ˆ
Ø—‘ ¨"Ó-ˆØˆØ—‘“ˆØˆØˆ	Û‰HˆC�Ø�‰˜sÕ#ØÐ6¤ F£¬c°#«hÑ 6Ð7¸#Ñ=�ÙØØ˜U‘]¨$¯,©,¬.Ø—L‘L Õ&à×$Ñ$ TÕ*ðô”T—]‘] 5Ó)Ó*ˆð
—N‘N 6¨2Ö.ˆDØ˜5Ò Ø—‘˜Q Õ%ð/ñØ�L‰L˜Ô#ÙØ�L‰L˜ÔØ %ˆ�
‰
�9ÑØˆr0có—y)aCheck whether the token is a base form that does not need further
        analysis for lemmatization.

        token (Token): The token.
        RETURNS (bool): Whether the token is a base form.

        DOCS: https://spacy.io/api/lemmatizer#is_base_form
        Fr:)rSr_s  r.r‡zLemmatizer.is_base_forms€ðr0©ÚexcludeÚpathr£có\‡‡—i}ˆˆfd„|d<ˆfd„|d<tj||‰«y)zÞSerialize the pipe to disk.

        path (str / Path): Path to a directory.
        exclude (Iterable[str]): String names of serialization fields to exclude.

        DOCS: https://spacy.io/api/lemmatizer#to_disk
        có>•—‰jj|‰¬«S©Nr¢)r-Úto_disk©Úpr£rSs €€r.ú<lambda>z$Lemmatizer.to_disk.<locals>.<lambda>sø€ t§z¡z×'9Ñ'9¸!ÀWÐ'9Ô'Mr0r-có:•—‰jj|«Sr9)rHr¨©rªrSs €r.r«z$Lemmatizer.to_disk.<locals>.<lambda>sø€¨¯©×)=Ñ)=¸aÔ)@r0rHN)rr¨)rSr¤r£Ú	serializes` ` r.r¨zLemmatizer.to_disks.ù€ðˆ	ÜMˆ	�'ÑÛ@ˆ	�)ÑÜ�‰�T˜9 gÕ.r0có~‡‡—i}ˆˆfd„|d<ˆfd„|d<tj||‰«‰j«‰S)aHLoad the pipe from disk. Modifies the object in place and returns it.

        path (str / Path): Path to a directory.
        exclude (Iterable[str]): String names of serialization fields to exclude.
        RETURNS (Lemmatizer): The modified Lemmatizer object.

        DOCS: https://spacy.io/api/lemmatizer#from_disk
        có>•—‰jj|‰¬«Sr§)r-Ú	from_diskr©s €€r.r«z&Lemmatizer.from_disk.<locals>.<lambda>-sø€¨¯©×)=Ñ)=¸aÈÐ)=Ô)Qr0r-có:•—‰jj|«Sr9)rHr±rs €r.r«z&Lemmatizer.from_disk.<locals>.<lambda>.sø€¨4¯<©<×+AÑ+AÀ!Ô+Dr0rH)rr±rY)rSr¤r£Údeserializes` ` r.r±zLemmatizer.from_disk!s?ù€ð8:ˆÜQˆ�GÑÛ!Dˆ�IÑÜ�‰�t˜[¨'Ô2Ø×ÑÔØˆr0cóz‡‡—i}ˆˆfd„|d<‰jj|d<tj|‰«S)zçSerialize the pipe to a bytestring.

        exclude (Iterable[str]): String names of serialization fields to exclude.
        RETURNS (bytes): The serialized object.

        DOCS: https://spacy.io/api/lemmatizer#to_bytes
        có<•—‰jj‰¬«Sr§)r-Úto_bytes)r£rSs€€r.r«z%Lemmatizer.to_bytes.<locals>.<lambda><sø€ T§Z¡Z×%8Ñ%8ÀÐ%8Ô%Ir0r-rH)rHr¶r)rSr£r®s`` r.r¶zLemmatizer.to_bytes3s9ù€ðˆ	ÜIˆ	�'ÑØ#Ÿ|™|×4Ñ4ˆ	�)ÑÜ�}‰}˜Y¨Ó0Ð0r0Ú