www.ravenbrook.com - util.cpython-36.pyc

3

V'íc#ã@s8dZddlZddlmZejdZejdejejBejBZ	ejdej
ejBejBZejdejZ
Gdd	d	eZGd
ddeZd1d
dZd2ddZd3ddZd4ddZddZddZddZddZddZiZdd Zd!d"Zd5d#d$Zffd%d&ZGd'd(d(Z d)d*Z!d+d,Z"d-d.Z#Gd/d0d0eZ$dS)6z±
    pygments.util
    ~~~~~~~~~~~~~

    Utility functions.

    :copyright: Copyright 2006-2022 by the Pygments team, see AUTHORS.
    :license: BSD, see LICENSE for details.
éN)Ú
TextIOWrapperz[/\\ ]z
    <!DOCTYPE\s+(
     [a-zA-Z_][a-zA-Z0-9]*
     (?: \s+      # optional in HTML5
     [a-zA-Z_][a-zA-Z0-9]*\s+
     "[^"]*")?
     )
     [^>]*>
z<(.+?)(\s.*?)?>.*?</.+?>z\s*<\?xml[^>]*\?>c@seZdZdZdS)Ú
ClassNotFoundzCRaised if one of the lookup functions didn't find a matching class.N)Ú__name__Ú
__module__Ú__qualname__Ú__doc__©rrú1/tmp/pip-build-gk9425m9/Pygments/pygments/util.pyrsrc@seZdZdS)ÚOptionErrorN)rrrrrrr	r
"sr
FcCs@|j||}|r|j}||kr<td|djtt|f|S)Nz%Value for option %s must be one of %sz, )ÚgetÚlowerr
ÚjoinÚmapÚstr)ÚoptionsÚoptnameÚallowedÚdefaultÚnormcaseÚstringrrr	Úget_choice_opt&srcCs||j||}t|tr|St|tr,t|St|tsHtd||fn0|jd
krXdS|jdkrhdStd||fdS)NzBInvalid type %r for option %s; use 1/0, yes/no, true/false, on/offÚ1ÚyesÚtrueÚonTÚ0ÚnoÚfalseÚoffFzCInvalid value %r for option %s; use 1/0, yes/no, true/false, on/off)rrrr)rrrr)rÚ
isinstanceÚboolÚintrr
r)rrrrrrr	Úget_bool_opt0s


r"cCs`|j||}yt|Stk
r8td||fYn$tk
rZtd||fYnXdS)Nz=Invalid type %r for option %s; you must give an integer valuez>Invalid value %r for option %s; you must give an integer value)rr!Ú	TypeErrorr
Ú
ValueError)rrrrrrr	Úget_int_optDsr%cCsH|j||}t|tr|jSt|ttfr4t|Std||fdS)Nz9Invalid type %r for option %s; you must give a list value)rrrÚsplitÚlistÚtupler
)rrrÚvalrrr	Úget_list_optRs
r*cCsR|js
dSg}x4|jjjD]"}|jr>|jd|jqPqWdj|jS)NÚú )rÚstripÚ
splitlinesÚappendr
Úlstrip)ÚobjÚresÚlinerrr	Údocstring_headline^sr4csfdd}j|_t|S)zAReturn a static text analyser function that returns float values.cs\y|}Wntk
r dSX|s*dSytdtdt|Sttfk
rVdSXdS)Nggð?)Ú	ExceptionÚminÚmaxÚfloatr$r#)ÚtextÚrv)Úfrr	Útext_analyselsz%make_analysator.<locals>.text_analyse)rÚstaticmethod)r;r<r)r;r	Úmake_analysatorjsr>cCs|jd}|dkr$|d|j}n|j}|jdry(ddtj|ddjDd}Wntk
rrd	SXtjd
|tj	}|j
|dk	rdSd	S)
aòCheck if the given regular expression matches the last part of the
    shebang if one exists.

        >>> from pygments.util import shebang_matches
        >>> shebang_matches('#!/usr/bin/env python', r'python(2\.\d)?')
        True
        >>> shebang_matches('#!/usr/bin/python2.4', r'python(2\.\d)?')
        True
        >>> shebang_matches('#!/usr/bin/python-ruby', r'python(2\.\d)?')
        False
        >>> shebang_matches('#!/usr/bin/python/ruby', r'python(2\.\d)?')
        False
        >>> shebang_matches('#!/usr/bin/startsomethingwith python',
        ...                 r'python(2\.\d)?')
        True

    It also checks for common windows executable file extensions::

        >>> shebang_matches('#!C:\\Python2.4\\Python.exe', r'python(2\.\d)?')
        True

    Parameters (``'-f'`` or ``'--foo'`` are ignored so ``'perl'`` does
    the same as ``'perl -e'``)

    Note that this method automatically searches the whole string (eg:
    the regular expression is wrapped in ``'^$'``)
    Ú
rNz#!cSs g|]}|r|jdr|qS)ú-)Ú
startswith)Ú.0Úxrrr	ú
<listcomp>sz#shebang_matches.<locals>.<listcomp>ééFz^%s(\.(exe|cmd|bat|bin))?$Téÿÿÿÿ)ÚfindrrAÚ
split_path_rer&r-Ú
IndexErrorÚreÚcompileÚ
IGNORECASEÚsearch)r9ÚregexÚindexÚ
first_lineÚfoundrrr	Úshebang_matches{s


rScCs<tj|}|dkrdS|jd}tj|tjj|jdk	S)zÁCheck if the doctype matches a regular expression (if present).

    Note that this method only checks the first part of a DOCTYPE.
    eg: 'html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN"'
    NFrF)Údoctype_lookup_rerNÚgrouprKrLÚIÚmatchr-)r9rOÚmÚdoctyperrr	Údoctype_matches¨s


rZcCs
t|dS)z3Check if the file looks like it has a html doctype.Úhtml)rZ)r9rrr	Úhtml_doctype_matchesµsr\cCsltj|rdSt|}yt|Stk
rftj|}|dk	rDdStj|dddk	}|t|<|SXdS)z2Check if a doctype exists or if we have some tags.TNiè)Úxml_decl_rerWÚhashÚ_looks_like_xml_cacheÚKeyErrorrTrNÚtag_re)r9ÚkeyrXr:rrr	Úlooks_like_xml½s

rccCsd|d?d|d@fS)zoGiven a unicode character code with length greater than 16 bits,
    return the two 16 bit surrogate pair.
    iÀ×é
iÜiÿr)Úcrrr	Ú
surrogatepairÍsrfc	Cs¬g}d|d}d|dd}|j||d|rXx\|D]}|j||dq<Wn<x:|D]2}t|d}|j||dd|ddq^W|j|d	d
j|S)
z)Formats a sequence of strings for output.r,érFz = (ú,ú"NrEú)r?éþÿÿÿrG)r/Úreprr
)	Úvar_nameÚseqÚrawÚindent_levelÚlinesZbase_indentZinner_indentÚiÚrrrr	Úformat_linesÖs

&rtcCsBg}t}x2|D]*}||ks||kr&q|j||j|qW|S)za
    Returns a list with duplicates removed from the iterable `it`.

    Order is preserved.
    )Úsetr/Úadd)ÚitZalready_seenÚlstÚseenrrrrr	Úduplicates_removedés

rzc@seZdZdZddZdS)ÚFuturezGeneric class to defer some work.

    Handled specially in RegexLexerMeta, to support regex string construction at
    first use.
    cCstdS)N)ÚNotImplementedError)Úselfrrr	rÿsz
Future.getN)rrrrrrrrr	r{ùsr{cCsty|jd}|dfStk
rny ddl}|j}|j}||fSttfk
rh|jd}|dfSXYnXdS)zÃDecode *text* with guessed encoding.

    First try UTF-8; this should fail for non-UTF-8 encodings.
    Then try the preferred locale encoding.
    Fall back to latin-1, which always works.
    zutf-8rNÚlatin1)ÚdecodeÚUnicodeDecodeErrorÚlocaleÚgetpreferredencodingÚLookupError)r9rZprefencodingrrr	Úguess_decodes

rcCsDt|ddr<y|j|j}Wntk
r0YnX||jfSt|S)zÊDecode *text* coming from terminal *term*.

    First try the terminal encoding, if given.
    Then try UTF-8.  Then try the preferred locale encoding.
    Fall back to latin-1, which always works.
    ÚencodingN)Úgetattrrrrr)r9Útermrrr	Úguess_decode_from_terminals
rcCs"t|ddr|jSddl}|jS)z7Return our best guess of encoding for the given *term*.rNr)rrrr)rrrrr	Úterminal_encoding)src@seZdZddZdS)ÚUnclosingTextIOWrappercCs|jdS)N)Úflush)r}rrr	Úclose3szUnclosingTextIOWrapper.closeN)rrrrrrrr	r1sr)NF)N)N)N)Fr)%rrKÚiorrLrIÚDOTALLÚ	MULTILINEÚVERBOSErTrMrarVr]r$rr5r
rr"r%r*r4r>rSrZr\r_rcrfrtrzr{rrrrrrrr	Ú<module>	s:





-