Skip to main content

Similarity Index

Updated over 3 years ago

Similarity Index reflects statistical and semantic similarity between documents. The index range is centered between a minimum of 0 and a maximum of 1.000. Independent of the underlying technology patents with a similarity of less than 100 will not be returned. A similarity value of 1.000 means the text is not only semantically but also lexically identical. Due to our complex algorithms the value of the index cannot be interpreted in a strictly linear manner (e.g. a patent pair with a value of 400 is not a third more similar than a patent pair with a value of 300).

Did this answer your question?