Single run of TimeXNet from command line

TimeXNet can be run once from command line using this option. The program can be executed as follows:

java –classpath [full installation directory]/timexnet.jar timexnet.NetPath [input network] [initial gene list] [intermediate gene list] [late gene list] [gamma1] [gamma2] [output directory] [glpsol location]

It is necessary to specify the fully qualified path where the jar file is installed in order to run TimeXNet. "glpsol location" is an optional parameter.


Input parameters

The input parameters must be given in the specified order.

  1. Input network: A list of interactions with their directionality and their reliability score in the format:

    "Molecule1[TAB]Molecule2[TAB]Direction[TAB]Reliability"

    • Molecules 1 and 2 can be an Id or a name.
    • The direction of the interaction can be either "--" (bidirectional interaction) or "->" (uni-directional interaction).
    • The reliability score is between 0 and 1.
    • This data is currently provided in the form of a tab-delimited flat file.

    The molecule name should be the same as in the gene lists.

  2. Initial gene list: A list of the genes of interest showing their greatest change in expression in the early hours after stimulation along with their scores in the format:

    "Molecule[TAB]Score"

    • Molecule can be an ID or name. The node identifier used here is the same as that used in the interactions file above.
    • The score can be any positive real number that indicates the amount of change in expression of the gene.
    • This data is also provided in the form of a tab-delimited flat-file.

  3. Intermediate gene list: A list of the genes of interest showing their greatest change in expression in the intermediate hours after stimulation along with their scores in the format specified above.

  4. Late gene list: A list of the genes of interest showing their greatest change in expression in the late hours after stimulation along with their scores in the format specified above. The 3 groups of genes can be made based on their expression patterns over time at the userís discretion.

  5. Gamma1: A positive real value that controls the number of Initial genes included in the predicted network. A larger value results in inclusion of more Initial genes.

  6. Gamma2: A positive real value that controls the number of Intermediate genes included in the predicted network. A larger value results in inclusion of more Intermediate genes.

  7. Output directory: Fully qualified path of the location where the output files should be stored.

  8. GLPSOL location: The fully qualified path to the GNU executable GLPSOL (including the name of the executable) used to solved the optimization problem eg. "c:\\Program Files (x86)\\GnuWin32\\bin\\glpsol.exe". This is an optional parameter and is used only if TimeXNet is not able to find a GLPK installation.


Output files

TimeXNet generates the following output files:

  1. lp_form_g1-g2: The problem formulation in the format required by glpsol

  2. lp_sol_g1-g2: The output file generated by glpsol that contains the solution to the optimization problem

  3. lp_sol_g1-g2.edges: The list of interactions with their flows parsed from the glpsol output file. The format is as follows:

    "Molecule1~Molecule2[TAB]Type[TAB]Flow"

    Here "Type" can be one of "pp" or "pd" indicating a bi-directional or uni-directional interaction, respectively. This file is in a tab-delimited format and can be directly uploaded to Cytoscape to visualize the network.

  4. lp_sol_g1-g2.nodes: The list of nodes in the network in (3) with their associated flows, calculated by adding all the incoming flows per node. The format of this file is as follows:

    "Molecule[TAB]Type[TAB]Flow"

    In this case, Type can be one of SRC (initial genes), INT (intermediate genes), SNK2 (late genes) or NOD (predicted gene showing no change in expression). This file is also in a tab-delimited format and can be uploaded into Cytoscape.

  5. log_g1-g2: Log file showing the detailed progress of the TimeXNet run including the duplicate edges identified and ignored, edges and nodes with erroneous weights and scores, and the detailed output of the glpsol program.

  6. edge_lst_g1-g2: List of edges used to run the final cost flow optimization problem. This file represents the final input network.

  7. Additional filesTimeXNet also generates node and egde attribute files along with a .sif file representing the network that can by uploaded into Cytoscape.
Human Genome Centre, Institute of Medical Science, University of Tokyo