格式:
Case SEQUENCE : NumTerms(VInt32) |<TermID(VInt32)> NumTerms
Case FREQ : NumTerms(VInt32) | NumDistinct (VInt32) |<TermID(VInt32) | TermFreqs(VInt32)> NumDistinct
Case POSITION : NumTerms(VInt32) | NumDistinct(VInt32) | <TermID(VInt32) |TermFreqs(VInt32) | <Position(VInt32)> TermFreqs > NumDistinct
说明:
- NumTerms:词总数
- TermID:词ID
- NumDistinct:非重复词总数
- TermFreqs:词频
- Position : 词位置,差量编码 , 变长压缩
|