Tokenization

From Working EDRM

Jump to: navigation, search
Comments: Please submit comments to the EDRM Glossary forum
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 1 2 3 4 5 6 7 8 9

An operation that examines a document or block of text and breaks the text into words. Typically, a space is used to separate words, but special characters such as a hyphen, period, or quotation mark can also be used.[1]

Footnotes

  1. ^  EDRM Search Glossary
Personal tools
additional information