Class GeneXMLUtils
- java.lang.Object
-
- de.julielab.jules.ae.genemapping.resources.ncbigene.GeneXMLUtils
-
public class GeneXMLUtils extends java.lang.Object
-
-
Field Summary
Fields Modifier and Type Field Description static java.lang.StringEG2ENTREZGENE_PROT_FILENAMEstatic java.lang.StringEG2SUMMARY_FILENAMEstatic java.lang.StringTAXIDS_FILENAME
-
Constructor Summary
Constructors Constructor Description GeneXMLUtils()
-
Method Summary
All Methods Static Methods Concrete Methods Modifier and Type Method Description static java.util.Set<java.lang.String>determineMissingTaxIds(java.io.File taxIdFile, java.io.File storageDirectory, java.io.File dbFile, java.io.File downloadedTaxIdsFile)static java.util.List<GeneXmlExtract>extractAndWriteGeneInfoToFile(java.io.OutputStream osSummaries, java.io.OutputStream osProtnames, java.io.OutputStream ofRefSeqAndTracStatus, java.io.InputStream is)static java.util.List<GeneXmlExtract>extractGeneInfoFromXml(java.io.InputStream openStream)static java.util.List<java.io.File>getMetaFiles(java.io.File storageDirectory)Returns the file objects for the meta information files retrieved from NCBI Gene XML data.static voidwriteGeneInfoToFile(java.util.List<GeneXmlExtract> geneExtractList, java.io.OutputStream osSummaries, java.io.OutputStream osProtnames, java.io.OutputStream osRefSeqAndTrackStatus)
-
-
-
Field Detail
-
TAXIDS_FILENAME
public static final java.lang.String TAXIDS_FILENAME
- See Also:
- Constant Field Values
-
EG2ENTREZGENE_PROT_FILENAME
public static final java.lang.String EG2ENTREZGENE_PROT_FILENAME
- See Also:
- Constant Field Values
-
EG2SUMMARY_FILENAME
public static final java.lang.String EG2SUMMARY_FILENAME
- See Also:
- Constant Field Values
-
-
Method Detail
-
writeGeneInfoToFile
public static void writeGeneInfoToFile(java.util.List<GeneXmlExtract> geneExtractList, java.io.OutputStream osSummaries, java.io.OutputStream osProtnames, java.io.OutputStream osRefSeqAndTrackStatus) throws java.io.IOException
- Throws:
java.io.IOException
-
extractGeneInfoFromXml
public static java.util.List<GeneXmlExtract> extractGeneInfoFromXml(java.io.InputStream openStream) throws javax.xml.stream.XMLStreamException, java.io.IOException
- Throws:
javax.xml.stream.XMLStreamExceptionjava.io.IOException
-
extractAndWriteGeneInfoToFile
public static java.util.List<GeneXmlExtract> extractAndWriteGeneInfoToFile(java.io.OutputStream osSummaries, java.io.OutputStream osProtnames, java.io.OutputStream ofRefSeqAndTracStatus, java.io.InputStream is) throws javax.xml.stream.XMLStreamException, java.io.IOException
- Throws:
javax.xml.stream.XMLStreamExceptionjava.io.IOException
-
determineMissingTaxIds
public static java.util.Set<java.lang.String> determineMissingTaxIds(java.io.File taxIdFile, java.io.File storageDirectory, java.io.File dbFile, java.io.File downloadedTaxIdsFile) throws java.io.IOException- Throws:
java.io.IOException
-
getMetaFiles
public static java.util.List<java.io.File> getMetaFiles(java.io.File storageDirectory)
Returns the file objects for the meta information files retrieved from NCBI Gene XML data. The list of files contains- gene summaries file
- gene protein names file
- taxonomy ID list of organisms for which the other files contain records
- gene RefSeq status
- Parameters:
storageDirectory- The base directory where to find/store the meta data.- Returns:
- An ordered list of gene meta data files.
-
-