|
.NET Framework | |||||||||
PREV TYPE NEXT TYPE | FRAMES NO FRAMES | |||||||||
SUMMARY: INNER | FIELD | CONST | PROP | METHOD | OP | EVENT | DETAIL: FIELD | CONST | PROP | METHOD | OP | EVENT |
Distills metadata or a string for uniform indexing and querying, e.g changing 'You had a great time on that horse' to 'great time horse'
Field Summary | |
---|---|
System.Globalization.CultureInfo | culture |
double |
FILE_WEIGHT
If percentage of stopwords is under this amount, take them out |
double |
META_WEIGHT
If percentage of stopwords is under this amount, take them out |
string[] | stopWords |
Constructor Summary | |
---|---|
Distiller() Initializes a new instance of the Distiller class. |
Method Summary | |
---|---|
string[] |
DistillFilename(string filename) Distills a filename into a set of words and hashes them. |
string |
DistillMeta(string key, string val) Distills a metadata value by taking out non-critical information, and hashes key:val |
string[] |
MarkAndSplit(string source, int[] marks, int marked) Splits up source text into words and marks words that are stop words or non-alphabetical text. |
string[] |
SplitWords(string words) Splits a text string into words |
Methods inherited from class System.Object |
---|
Equals, Finalize, GetHashCode, GetType, MemberwiseClone, ToString |
Field Detail |
---|
private System.Globalization.CultureInfo culture
private double FILE_WEIGHT
If percentage of stopwords is under this amount, take them out
private double META_WEIGHT
If percentage of stopwords is under this amount, take them out
private string[] stopWords
Constructor Detail |
---|
private Distiller()
Initializes a new instance of the Distiller class.
Method Detail |
---|
public string[] DistillFilename(string filename)
Distills a filename into a set of words and hashes them.
Example: Morcheeba - Big Calm: 01 The Sea.mp3 Would get converted into {morcheeba, big, calm, sea}
Parameters:filename
- Filenamepublic string DistillMeta(string key, string val)
Distills a metadata value by taking out non-critical information, and hashes key:val
Parameters:
val
- Valuekey
- Keyprivate string[] MarkAndSplit(string source, int[] marks, int marked)
Splits up source text into words and marks words that are stop words or non-alphabetical text.
Parameters:
source
- Source textmarks
- Marks (0 is a word, 1 is a stop word, 2 is punctuationmarked
- How many marks where madeprivate string[] SplitWords(string words)
Splits a text string into words
Parameters:
words
- Text
|
.NET Framework | |||||||||
PREV TYPE NEXT TYPE | FRAMES NO FRAMES | |||||||||
SUMMARY: INNER | FIELD | CONST | PROP | METHOD | OP | EVENT | DETAIL: FIELD | CONST | PROP | METHOD | OP | EVENT |