Class EmacsMuleEncoding

    • Constructor Detail

      • EmacsMuleEncoding

        protected EmacsMuleEncoding()
    • Method Detail

      • length

        public int length​(byte[] bytes,
                          int p,
                          int end)
        Description copied from class: Encoding
        Returns character length given stream, character position and stream end returns 1 for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwise
        Specified by:
        length in class Encoding
        Returns:
        0 Never > 0 Valid character, length returned -1 Illegal/malformed character < -1 (-1 - n) Number of missing bytes for character in p...end range Oniguruma equivalent: mbc_enc_len modified for 1.9 purposes,
      • mbcToCode

        public int mbcToCode​(byte[] bytes,
                             int p,
                             int end)
        Description copied from class: Encoding
        Returns code point for a character Oniguruma equivalent: mbc_to_code
        Specified by:
        mbcToCode in class Encoding
      • codeToMbcLength

        public int codeToMbcLength​(int code)
        Description copied from class: Encoding
        Returns character length given a code point Oniguruma equivalent: code_to_mbclen
        Specified by:
        codeToMbcLength in class Encoding
      • codeToMbc

        public int codeToMbc​(int code,
                             byte[] bytes,
                             int p)
        Description copied from class: Encoding
        Extracts code point into it's multibyte representation
        Specified by:
        codeToMbc in class Encoding
        Returns:
        character length for the given code point Oniguruma equivalent: code_to_mbc
      • mbcCaseFold

        public int mbcCaseFold​(int flag,
                               byte[] bytes,
                               IntHolder pp,
                               int end,
                               byte[] lower)
        onigenc_ascii_mbc_case_fold
        Parameters:
        flag - case fold flag
        pp - an IntHolder that points at character head
        lower - a buffer where to extract case folded character Oniguruma equivalent: mbc_case_fold
      • isCodeCType

        public boolean isCodeCType​(int code,
                                   int ctype)
        Description copied from class: Encoding
        Perform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)
        Specified by:
        isCodeCType in class Encoding
        Parameters:
        code - a code point of a character
        ctype - a character type to check against Oniguruma equivalent: is_code_ctype
      • ctypeCodeRange

        public int[] ctypeCodeRange​(int ctype,
                                    IntHolder sbOut)
        Description copied from class: Encoding
        Returns code range for a given character type Oniguruma equivalent: get_ctype_code_range
        Specified by:
        ctypeCodeRange in class Encoding
      • leftAdjustCharHead

        public int leftAdjustCharHead​(byte[] bytes,
                                      int p,
                                      int s,
                                      int end)
        Description copied from class: Encoding
        Seeks the previous character head in a stream Oniguruma equivalent: left_adjust_char_head
        Specified by:
        leftAdjustCharHead in class Encoding
        Parameters:
        bytes - byte stream
        p - position
        s - stop
        end - end
      • isReverseMatchAllowed

        public boolean isReverseMatchAllowed​(byte[] bytes,
                                             int p,
                                             int end)
        Description copied from class: Encoding
        Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent: is_allowed_reverse_match
        Specified by:
        isReverseMatchAllowed in class Encoding
      • isCodeCTypeInternal

        protected final boolean isCodeCTypeInternal​(int code,
                                                    int ctype)
        ONIGENC_IS_XXXXXX_CODE_CTYPE
      • isNewLine

        public boolean isNewLine​(byte[] bytes,
                                 int p,
                                 int end)
        onigenc_is_mbc_newline_0x0a / used also by multibyte encodings
        Specified by:
        isNewLine in class Encoding
      • asciiMbcCaseFold

        protected final int asciiMbcCaseFold​(int flag,
                                             byte[] bytes,
                                             IntHolder pp,
                                             int end,
                                             byte[] lower)
      • applyAllCaseFold

        public void applyAllCaseFold​(int flag,
                                     ApplyAllCaseFoldFunction fun,
                                     Object arg)
        onigenc_ascii_apply_all_case_fold / used also by multibyte encodings
        Specified by:
        applyAllCaseFold in class Encoding
        Parameters:
        flag - case fold flag
        fun - case folding functor (look at: ApplyCaseFold)
        arg - case folding functor argument (look at: ApplyCaseFoldArg) Oniguruma equivalent: apply_all_case_fold
      • asciiCaseFoldCodesByString

        protected final CaseFoldCodeItem[] asciiCaseFoldCodesByString​(int flag,
                                                                      byte[] bytes,
                                                                      int p,
                                                                      int end)
      • caseFoldCodesByString

        public CaseFoldCodeItem[] caseFoldCodesByString​(int flag,
                                                        byte[] bytes,
                                                        int p,
                                                        int end)
        onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodings
        Specified by:
        caseFoldCodesByString in class Encoding
      • propertyNameToCType

        public int propertyNameToCType​(byte[] bytes,
                                       int p,
                                       int end)
        onigenc_minimum_property_name_to_ctype notably overridden by unicode encodings
        Specified by:
        propertyNameToCType in class Encoding