|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object de.zimek.proteinfeatures.ArffFileCreator
Main class for creating protein features in arff format. The features are created according to the feature extracting classes specified in a given property file. The property file can specify following properties:
SCOPLEVEL CATHLEVEL ATTRIBUTES MINSUPPORT MVALUES AAINDICES MOTIF_MAPSCOPLEVEL can get assigned either one of CLASS, FOLD, SUPERFAMILY, or FAMILY.
de.zimek.proteinfeatures.attributeAssigner.ArffAttributeAssigner
s.de.zimek.proteinfeatures.attributeAssigner.AutoCorrelationFunction
s in the specified list of de.zimek.proteinfeatures.attributeAssigner.ArffAttributeAssigner
s (same order).de.zimek.proteinfeatures.aaindex.AAIndex
for all AutoCorrelationFunction in the specified list of de.zimek.proteinfeatures.attributeAssigner.ArffAttributeAssigner
s (same order).de.zimek.proteinfeatures.attributeAssigner.MotifAssigner
s in the specified list of de.zimek.proteinfeatures.attributeAssigner.ArffAttributeAssigner
s (same order).
Field Summary | |
static java.lang.String |
AAINDEX_PACKAGE
The name of the package for aaindex assigners. |
private ArffAttributeAssigner[] |
arffAttributeAssigner
The ArffAttributeAssigners that are to create features. |
static java.lang.String |
ATTRIBUTE_PROPERTY
The property-name for attributes. |
private ID[] |
coveredClasses
THe ids of covered classes. |
static java.lang.String |
DEFAULT_PACKAGE
The default package to search for specified classes. |
static java.lang.String |
DEFAULT_PARSER_PACKAGE
The default parser package. |
private int |
level
The level of hierarchy. |
private int |
minSupport
Default minimum support. |
static java.lang.String |
MOTIF_PROPERTY
The property-name for a motif map. |
private java.lang.String |
relation
Name of the relation. |
static java.lang.String[] |
SUB_PACKAGES
The subpackages for group descriptors. |
Constructor Summary | |
ArffFileCreator(java.util.Properties properties)
Initializes the ArffFileCreator according to the settings in the given properties. |
Method Summary | |
protected java.lang.String |
concatenate(java.lang.String[] subclasses)
Returns a String describing a set of the given array of class-names. |
protected Protein[] |
filterClasses(Protein[] proteins)
Returns only the proteins having at least minimumSupport. |
protected java.lang.String |
getCoveredClassesArff()
Returns the covered classes as arff-formatted Attribute-String. |
protected java.lang.String |
getCoveredHierarchy()
Returns the covered hierarchy as a single String. |
java.lang.String |
getDataLine(Protein protein)
Returns a String providing the data-entry-line in arff-format for the given protein, at last the class according to the current set class-level in scop- or cath-notation and a newline. |
static void |
main(java.lang.String[] args)
Main routine to create an arff file providing the features as specified in a given property file. |
void |
printArffFile(Protein[] proteins,
java.io.PrintStream out)
Prints an arff file for the given proteins according to the current settings to the given PrintStream. |
void |
setCoveredClasses(Protein[] proteins)
Sets the covered classes according to the classes occuring in the given array of proteins. |
void |
setMinSupport(int minSupport)
Sets the minSupport. |
protected java.lang.String |
subclasses(ID[] classes,
int level)
Returns a String describing the set of subclasses splitted at the given level. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
public static final java.lang.String ATTRIBUTE_PROPERTY
public static final java.lang.String MOTIF_PROPERTY
public static final java.lang.String DEFAULT_PACKAGE
public static final java.lang.String[] SUB_PACKAGES
public static final java.lang.String DEFAULT_PARSER_PACKAGE
public static final java.lang.String AAINDEX_PACKAGE
private ArffAttributeAssigner[] arffAttributeAssigner
private int level
private ID[] coveredClasses
private java.lang.String relation
private int minSupport
Constructor Detail |
public ArffFileCreator(java.util.Properties properties) throws java.lang.IllegalArgumentException
properties
- properties providing the properties for ArffFileCreator
java.lang.IllegalArgumentException
Method Detail |
public void printArffFile(Protein[] proteins, java.io.PrintStream out)
proteins
- the proteins to print an arff file forout
- the PrintStream where to printpublic void setCoveredClasses(Protein[] proteins)
proteins
- the proteins whose classes are to be coveredprotected Protein[] filterClasses(Protein[] proteins)
proteins
-
public void setMinSupport(int minSupport)
minSupport
- The minSupport to setprotected java.lang.String getCoveredHierarchy()
protected java.lang.String subclasses(ID[] classes, int level)
classes
- the classes to be described as hierarchylevel
- the desired depth of the hierarchy
protected java.lang.String concatenate(java.lang.String[] subclasses)
subclasses
- the class-names to be collected in a set
protected java.lang.String getCoveredClassesArff()
public java.lang.String getDataLine(Protein protein)
protein
- the protein to create the feature entry for
public static void main(java.lang.String[] args) throws java.lang.Exception
args
-
java.lang.Exception
|
|||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |