Package version:

Interface CjkBigramTokenFilter

Forms bigrams of CJK terms that are generated from the standard tokenizer. This token filter is implemented using Apache Lucene.

interface CjkBigramTokenFilter {
    ignoreScripts?: CjkBigramTokenFilterScripts[];
    name: string;
    odatatype: "#Microsoft.Azure.Search.CjkBigramTokenFilter";
    outputUnigrams?: boolean;
}

Hierarchy (view full)

Properties

ignoreScripts?: CjkBigramTokenFilterScripts[]

The scripts to ignore.

name: string

The name of the token filter. It must only contain letters, digits, spaces, dashes or underscores, can only start and end with alphanumeric characters, and is limited to 128 characters.

odatatype

Polymorphic discriminator, which specifies the different types this object can be

outputUnigrams?: boolean

A value indicating whether to output both unigrams and bigrams (if true), or just bigrams (if false). Default is false.