o 0¦iÌRã@sÜ ddlZddlZddlmZdgZe d¡Ze d¡Ze d¡Ze d¡Z e d¡Z e d ¡Z e d ¡Z e d ¡Z e d ¡Ze d ¡Ze dej¡Ze dej¡Ze dej¡Ze d ¡Ze d¡ZGdd„dejƒZdS)éN)ÚunescapeÚ HTMLParserz[&<]z &[a-zA-Z#]z%&([a-zA-Z][-.a-zA-Z0-9]*)[^a-zA-Z0-9]z)&#(?:[0-9]+|[xX][0-9a-fA-F]+)[^0-9a-fA-F]z <[a-zA-Z]z z--!?>z-?>z0([a-zA-Z][^\t\n\r\f />]*)(?:[\t\n\r\f ]|/(?!>))*a{ ( (?<=['"\t\n\r\f /])[^\t\n\r\f />][^\t\n\r\f /=>]* # attribute name ) ([\t\n\r\f ]*=[\t\n\r\f ]* # value indicator ('[^']*' # LITA-enclosed value |"[^"]*" # LIT-enclosed value |(?!['"])[^>\t\n\r\f ]* # bare value ) )? (?:[\t\n\r\f ]|/(?!>))* # possibly followed by a space a [a-zA-Z][^\t\n\r\f />]* # tag name [\t\n\r\f /]* # optional whitespace before attribute name (?:(?<=['"\t\n\r\f /])[^\t\n\r\f />][^\t\n\r\f /=>]* # attribute name (?:[\t\n\r\f ]*=[\t\n\r\f ]* # value indicator (?:'[^']*' # LITA-enclosed value |"[^"]*" # LIT-enclosed value |(?!['"])[^>\t\n\r\f ]* # bare value ) )? [\t\n\r\f /]* # possibly followed by a space )* >? aF <[a-zA-Z][^\t\n\r\f />\x00]* # tag name (?:[\s/]* # optional whitespace before attribute name (?:(?<=['"\s/])[^\s/>][^\s/=>]* # attribute name (?:\s*=+\s* # value indicator (?:'[^']*' # LITA-enclosed value |"[^"]*" # LIT-enclosed value |(?!['"])[^>\s]* # bare value ) \s* # possibly followed by a space )?(?:\s|/(?!>))* )* )? \s* # trailing whitespace z#c@sþeZdZ dZdZdddœdd„Zdd „Zd d „Zd d „ZdZ dd„Z ddœdd„Z dd„Z d=dd„Z dd„Zdd„Zd=dd„Zd>dd „Zd!d"„Zd#d$„Zd%d&„Zd'd(„Zd)d*„Zd+d,„Zd-d.„Zd/d0„Zd1d2„Zd3d4„Zd5d6„Zd7d8„Zd9d:„Zd;d<„ZdS)?r)ZscriptZstyleZxmpZiframeZnoembedZnoframes)ZtextareaÚtitleTF)Úconvert_charrefsÚ scriptingcCs ||_||_| ¡dS©N)rrÚreset)Úselfrr©r ú2/opt/alt/python310/lib64/python3.10/html/parser.pyÚ__init__vs  zHTMLParser.__init__cCs6 d|_d|_t|_d|_d|_d|_tj  |¡dS)NÚz???T) ÚrawdataÚlasttagÚinteresting_normalÚ interestingÚ cdata_elemÚ_support_cdataÚ _escapableÚ _markupbaseÚ ParserBaser ©r r r r r „szHTMLParser.resetcCs |j||_| d¡dS)Nr)rÚgoahead©r Údatar r r ÚfeedŽs zHTMLParser.feedcCs | d¡dS)Né)rrr r r Úclose—szHTMLParser.closeNcCs |jSr)Ú_HTMLParser__starttag_textrr r r Úget_starttag_textszHTMLParser.get_starttag_text©Ú escapablecCst| ¡|_||_|jdkrt d¡|_dS|r*|js*t d|jtjtjB¡|_dSt d|jtjtjB¡|_dS)NÚ plaintextz\Zz&|])z])) ÚlowerrrÚreÚcompilerrÚ IGNORECASEÚASCII)r Úelemr"r r r Úset_cdata_mode¡s      ÿ   ÿzHTMLParser.set_cdata_modecCst|_d|_d|_dS)NT)rrrrrr r r Úclear_cdata_mode­s zHTMLParser.clear_cdata_modecCs ||_dSr)r)r Úflagr r r Ú_set_support_cdata²s zHTMLParser._set_support_cdatac Cs|j}d}t|ƒ}||krU|jr;|js;| d|¡}|dkr:| dt||dƒ¡}|dkr8t d¡  ||¡s8n|}n|j   ||¡}|rI|  ¡}n|jrNn|}||kro|jrf|j rf|  t|||…ƒ¡n |  |||…¡| ||¡}||kr{nÚ|j}|d|ƒrt ||¡r| |¡} n@|d|ƒr›| |¡} n5|d|ƒr¦| |¡} n*|d|ƒr±| |¡} n|d |ƒr¼| |¡} n|d |ksÄ|rÎ|  d¡|d } nn…| dkr‰|sÙn|t ||¡ràn§|d|ƒr|d |krò|  d¡n•t ||¡rùnŽ| ||d d…¡n‚|d|ƒr0|}d D]} | | |d ¡r"|t| ƒ8}nq| ||d |…¡nW|d|ƒrF|jrF| ||dd…¡nA|||d… ¡dkr_| ||d d…¡n(|d |ƒrq| ||d d…¡n|d|ƒrƒ| ||d d…¡nt dƒ‚|} | || ¡}nÁ|d|ƒràt! ||¡}|rÃ| "¡d d…} | #| ¡| $¡} |d| d ƒs¼| d } | || ¡}q d||d…vrß|  |||d …¡| ||d ¡}nu|d|ƒrQt% ||¡}|r| "d ¡} | &| ¡| $¡} |d| d ƒs | d } | || ¡}q t' ||¡}|r;|r:| "¡||d…kr:| $¡} | |kr2|} | ||d ¡}n|d |krP|  d¡| ||d ¡}nn||ks|r€||kr€|jrq|j rq|  t|||…ƒ¡n |  |||…¡| ||¡}||d…|_dS)Nrú<ú&é"z [\t\n\r\f ;]zrr9r6r8rr2rzÚ rhZnoscriptr#Fr!T)rÚcheck_for_whole_start_tagrÚtagfind_tolerantrDrTrRr$rÚattrfind_tolerantrÚappendÚstripZgetposÚcountr;r=rArLÚhandle_startendtagÚhandle_starttagÚCDATA_CONTENT_ELEMENTSrr*ÚRCDATA_CONTENT_ELEMENTS)r rXÚendposrÚattrsrDr[ÚtagÚmÚattrnameÚrestZ attrvaluerTÚlinenoÚoffsetr r r rE—sd  &( ó   ÿ   ù ÿ þzHTMLParser.parse_starttagcCs6|j}t ||d¡}| ¡}||ddkrdS|S)Nrrr9)rÚ locatetagendrDrTrer r r rjÐs z$HTMLParser.check_for_whole_start_tagcCs´|j}| d|d¡dkrdSt ||¡s*||d|d…dkr%|dS| |¡St ||d¡}| ¡}||ddkr@dSt ||d¡}| d¡  ¡}|  |¡|  ¡|S)Nrr2rr9r6r) rr<rJrDr_r|rTrkrRr$Ú handle_endtagr+)r rXrrDrZrvr r r rFÚs    zHTMLParser.parse_endtagcCs| ||¡| |¡dSr)rqr}©r rvrur r r rpøs zHTMLParser.handle_startendtagcCódSrr r~r r r rqýózHTMLParser.handle_starttagcCrrr )r rvr r r r}r€zHTMLParser.handle_endtagcCrrr ©r r]r r r rSr€zHTMLParser.handle_charrefcCrrr rr r r rV r€zHTMLParser.handle_entityrefcCrrr rr r r rA r€zHTMLParser.handle_datacCrrr rr r r rKr€zHTMLParser.handle_commentcCrrr )r Zdeclr r r rNr€zHTMLParser.handle_declcCrrr rr r r rOr€zHTMLParser.handle_picCrrr rr r r rMr€zHTMLParser.unknown_decl)T)r) Ú__name__Ú __module__Ú __qualname__rrrsr r rrrr r*r+r-rrIrGr_rHrErjrFrprqr}rSrVrArKrNrOrMr r r r rZs@    "   9  )r%rZhtmlrÚ__all__r&rrWrUrQrCrJrdr`rarkÚVERBOSErlr|Zlocatestarttagend_tolerantZ endendtagZ endtagfindrrr r r r Ús6             õ  óò