Welcome! Thank you for using TIMEOR.

Quick Start: 3 Steps

TIMEOR accepts 2 input types: (1) raw .fastq files and SraRunTable (e.g. here) or a (2) RNA-seq time-series read count matrix (e.g. here) and metadata file (e.g. here).

Visit https://timeor.brown.edu.
For (1) in ‘Example Data’ (side-bar) under ‘Load raw data’ click the ‘SraRunTable & .fastq files’ button. This will guide you through the ‘Set Input and Defaults, Process Raw Data’ tab demo. Follow pop-ups and fill in grey boxes. See Run TIMEOR for walk-through.
Next, for (2) in ‘Example Data’ (side-bar) under ‘Load count matrix’ click the ‘Metadata & read count file’ button. This will guide you through the rest of the full method demo. Follow pop-ups and fill in grey boxes. See Run TIMEOR for full application walk-through.

Important Points to Remember

We strongly encourage the user to input a read count matrix (and associated metadata file) when possible, as the input file size limit is 10GB.
For larger dataset processing, the user is encouraged to use our ready-to-use Docker image. Read 4 steps here, in Tutorials (left side-bar), ‘Web Server’ tab, ‘Local Installation’ section. If that is not possible, please feel free to contact us for specific space requirements. We are happy to help.
While TIMEOR analysis is running, simply make sure to revisit the page at least once an hour.
We strongly encourage the user to keep ‘5.Compare multiple methods’ set to ‘Yes’ to see TIMEOR’s full functionality.
Please click each button just once.
Once analysis has begun, please proceed through TIMEOR sequentially. The user can visit previous tabs, but proceed forward sequentially. Before beginning the analysis, the user can skim through each tab to see what is to come.
The user can download the demo data and go through the tutorial to get a sense for how long the analysis will take given user interface interaction (such as choosing a certain number of clusters besides 3).
TIMEOR supports these types of time-series data (note this is asked in Question 3 of “Determine Adaptive Default Methods”):
- control at 1st time point vs case (i.e. treatment) at subsequent time points
- control or case over time
Please compare one set of time-series experiments at a time.
- TIMEOR v1 does not support time-series RNA-seq of these types:
  - control v.s. case 1 vs. case 2
- In this case, simply perform the analysis separately:
  - control v.s. case 1
  - control v.s. case 2
NOTE Importantly, we assume that replicate batches are sampled with each batch sampled at each time point. That means batch 1 across 4 timepoints would have corresponding replicate 1 at time point 1, replicate 1 at time point 2, replicate 1 at time point 3 and replicate 1 at time point 4. This process continues for all batches. This structure is adopted to work with all three differential expression methods. Moreover, it is a common structure to control for non-biological variation in a time-series experiment. For example, say RNA-seq was performed on a cell line after insulin stimulation, on 10 consecutive time points, every 20 minutes, with three biological replicates (such as in our manuscript). This means that non-biological factors could be considered when determining temporal differential expression by being able to compare three biological replicates!
Some time-series experimental designs are complex. In those cases, and it is advised to reach out to us with any questions before beginning the analysis. We are very willing to help, and are responsive!
It is advised to skip past the ‘Enrichment’ tab if time is limited, as programs such as MEME can take a long time, even though we limited the motif size to maximum 20 basepairs. The user can certainly go back to the ‘Enrichment’ tab once the rest of the analysis is complete.
It is advised to wait until any running process is finished before downloading results or logs to ensure a successful download.
Thank you for using TIMEOR! Please help us improve to better assist you. Please contact us with questions, ideas, and suggestions. If an error occurs with your data, please download the log file (far left) to check. When contacting us with questions, please send the time, the log file, and if possible a screenshot so we know where in TIMEOR you are.

Two ways to input data:

Import SraRunTable from GEO* where TIMEOR will process raw data through retrieving .fastq files, quality control, alignment, and read count matrix creation. Read first tab of TIMEOR (Getting Started) for information about this input specification. Read this section for information about how to process these data in TIMEOR. We strongly encourage users to upload a read count matrix, or process raw .fastq data via TIMEOR’s interface locally using Docker (see 4 steps here) in Tutorials (left side-bar), ‘Web Server’ tab, ‘Local Installation’ section.
Import metadata file** and count matrix *** (skipping raw data retrieval, quality control, alignment, and read count matrix creation) and proceeding straight to normalization and correction. Read this section for information about how to process these data in TIMEOR.

Then simply follow the prompts. Fill out the grey boxes to begin interacting with each stage and tab.

Input file types:

  NOTE: see first tab of TIMEOR called Getting Started for specifications.

* SraRunTable from GEO requires at least these columns (which will be reordered to produce the metadata file).
- treatment, time, Run, replicate, batch
  - treatment: one word describing experiment
  - time: numerical values e.g. (0, 20, 40)
  - replicate: one integer description of replicate (e.g. 1, 2, 3) (could have same information as batch)
  - batch: one integer description of batch (e.g. 1, 2, 3)

** metadata file requires at least these columns.

ID, condition, time, batch
- ID: a unique identifier (ID) for the user (e.g. case1min_rep1)
- condition: one word description (e.g. case, control)
- time: numerical values e.g. (0, 20, 40)
- batch: one integer description of batch (e.g. 1, 2, 3)

An example might be:

  ID    batch condition   time
  simT0.1 1   control   0
  simT0.2 2   control   0
  simT0.3 3   control   0
  simT1.1 1   case      1
  simT1.2 2   case      1
  simT1.3 3   case      1
  simT2.1 1   case      2
  simT2.2 2   case      2
  simT2.3 3   case      2
  simT3.1 1   case      3
  simT3.2 2   case      3
  simT3.3 3   case      3

*** count matrix requires Ensembl or Flybase unique gene identifiers, and columns should be the IDs from metadata file, and in the same order as metadata file.

Inputs Detailts: SraRunTable

Please upload the SraRunTable.txt, which has comma delimiters.
Please remove as many unneeded colums as you can. Some dataset SraRunTables have odd delimiters that are difficult to parse.
Make sure that the resulting metadata file meets the requirements for the input metadata file (link here).
For paired-end reads, the pairs must have "_NUM" to distinguish them (e.g. SRRXXX_1.fastq.gz, SRRXXX_2.fastq.gz).

Input Data: .fastq Files

The .fastq files are downloaded as .fastq.gz.
Data should at least have two replicates.

Input Details: Metadata File

Make sure there are an equal number of replicates for each sample.
Label control as “control”.
Have time point data in order where control or time point 1 is at the top and the last time point is at the bottom.
Have unique IDs that ideally follow one of these formats:
- NAMETIME.BATCH (e.g.)
- NAMETIMEREPLICATE.BATCH
- NAMETIMEBATCH.REPLICATE
Sometimes when using editors such as Excel, odd delimiters specific to the user’s machine are added at the end of lines. We advise users to check that these are not present.
Please upload .csv files.

Input Data: Read Count Matrix

Pre-filter out any rows you are not interested to process (such as low count genes across all samples).
Gene ID column should be named “ID” and populated with Ensembl IDs.
Make sure columns are in the same order as the rows of metadata file.
Please upload .csv files.
Data should at least have two replicates.

Suggestions for How to Answer Six “Determine Adaptive Default Methods” Questions

Overall the user must select at least the organism, sequencing, and experiment type, then load metadata or SraRunTable.txt.
Question 1 asks: “What type of organism?” The user can choose from fruit fly, human, or mouse.
Question 2 asks: “What type of sequencing?” If the user is uploading a read count matrix, strongly encouraged, the user can choose “not applicable”.
Question 3 asks: “What type of experiment?” There are two options - “case vs. control”, and “just case or control” types of time-series that TIMEOR supports (see this section).
Question 4 asks: “What type of time-series?” There are three options - “close time point and long time series”, “close time point and short time series”, and “distant time point”. Based on the user’s understanding of the biological system, the user should decide whether the timepoints are considered close or far in time. This question is important to determine how to model differential gene expression (DE) trajectories over time.
- DESeq2 is a categorical DE method generally used to analyze timepoints separately. When time points are far apart this is a good option. TIMEOR uses DESeq2 if the user toggles to “distant time point”.
- When we are interested to model gene trajectories (when time points are close), we assess the temporal dynamic expression between time point $t$ given $t-1$. In this context, it is advised a continuous DE method. TIMEOR uses ImpulseDE2 if the user toggles to “close time point …”. ImpulseDE2 employs an impulse model to determine differentally expressed genes.
- Importantly, it is strongly advised to compare all three (ImpulseDE2, Next maSigPro, and DESeq2) DE methods’ results (by keeping ‘Yes’ for Question 5), especially when there are “close time points and short time-series”. Recent studies such as Spies et al. 2019 show that DESeq2 performs well when determining differentially expressed genes when time-series is short. To compare all three these, keep ‘Yes’ as the answer for Question 5 (below).
- Please see our manuscript for more a more robust explanation and a series of citations for further reading.
Question 5 asks: “Compare multiple methods (alignment and differential expression)?” If this question is left to ‘Yes’ (which is strongly encouraged), TIMEOR will run all methods for the user to determine the best suited method. This is important because in many cases the categorical method DESeq2 which does not consider gene trajectories, still returns a robust set of differentially expressed genes. If this is set to ‘No’, TIMEOR will run for alignment (if applicable): HISAT2, and for DE: DESeq2 (if distant time points selected in Question 4), or ImpulseDE2 (if close time points selected in Question 4).
Question 6 asks: “What is the maximum number of time steps over which one gene can influence the transcription of another gene?” This question prompts the user to tell TIMEOR the window of time over which one gene can directly influence another. Within this window all interactions are considered. It is advised to keep this value small if the time points are spaced out. Said differently, at each time point $t$ for a differentially expressed gene $g$, if Question 6’s answer were 2, TIMEOR would be asking, what are potential interactions of $g$ with other TFs across $t+1$ and $t+2$.

Method and question choice assistance

“Normalize and Correct” tab: there are two normalization options - upper quartile and trimmed mean of M-values. It is advised to try both methods through TIMEOR’s interactive interface because the influence of normalization differs depending on the RNA-seq data structure.
- There are several recent papers that discuss these differences such as Zyprych-Walczak et al. 2015, Pereira et al. 2018, and Abbas-Aghababazadeh et al. 2018.
“Normalize and Correct” tab: there are two options for correlating samples/replicates using the Pearson or Spearman correlation. The choice of correlation method depends heavily on the assumptions the user wants to make about their data, and it is encouraged to try both in TIMEOR’s interface. The user knows more about which samples/replicates (e.g. time points) should cluster together and how to identify outliers.
- Both correlation methods define the strength of the relationship between the samples/replicates. The Pearson correlation accounts for differences in the samples/replicates mean and standard deviation when defining the linear relationship. The Spearman correlation is actually a nonparametric measure that uses the rank values of the samples/replicates.
- Importantly, the more similar the expression profiles between samples/replicates, the higher the correlation coefficient will be.
- Furthermore, the user is encouraged to remove any outliers (if needed) for further analysis.
“Primary Analysis” stage: the user can choose to allow TIMEOR to automatically cluster the DE gene trajectories, or the user can choose the number of gene trajectory clusters. Importantly, finding the optimal solution to this hierarchical clustering problem is an NP-hard. Thus, user input is needed to assess a reasonable number of clusters for downstream analysis. To help, TIMEOR provides an automatic clustering option (PDF visible when folder is downloaded) which takes the mode between three unsupervised clustering methods (partition around medoids (Reynolds et al. 2006), Silhouette (Rousseeuw et al. 1987), and Calinski criterion (Calinkski et al. 1974)) to automatically return the number of gene trajectory clusters to the user. TIMEOR also provides an Elbow plot to show the user how the explained variation changes as a function of the number of clusters. The user can leverage this plot by picking the elbow of the curve. The user is encouraged to use the interactive clustermap and the clustering plots (available on download) to determine whether the automatic clustering option provides suitable clusters.
“Primary Analysis” stage: NOTE, there is not a fold change cut-off for the DE gene trajectories, only an adjusted p-value cutoff. This allows the user to view significant differences in expression trajectories while the fold change might be smaller. This is useful to observe changes for genes including non-coding genes and genes involved in dosage compensation.
“Secondary Analysis: Factor Binding”: the user is encouraged to “see each method’s predicted transcription factors” and search for protein-DNA data (in .bigWig format) to view the binding profile of that transcription factor across each gene trajectory cluster.
“Secondary Analysis: Temporal Relations”: the user can add additional genes or transcription factors (potentially viewed on Factor Binding tab) to the final gene regulatory network (GRN) within STRINGdb. NOTE: TIMEOR only reports the TF GRN using the observed and top one predicted TFs from the “Observed and Top Predicted Transcription Factors” table. The user is encouraged to view the results from individual methods (on Factor Binding tab) when constructing the final GRN, and view Temporal Relations Table to uncover the lead and lag relationships between TFs.

Local Installation

To run TIMEOR outside of website (recommended for preprocessing from raw .fastq files), users may use Docker and Docker Hub. First, the TIMEOR repository must be cloned (https://github.com/ashleymaeconard/TIMEOR.git). To use Docker, it must be installed (version 20.10.0 recommended).

Docker Hub and Docker:

Download contents of organism genome folder (/genomes_info/) into desired location (e.g. /Users/USERNAME/Desktop/test_folder/genomes_info/) to mount later.
- The user is welcome to gather only the organism of interest. For example, for Drosophila melanogaster simply download /genomes_info/dme/
  - Mouse is /genomes_info/mmu/
  - Human is /genomes_info/hsa/
- Link /genomes_info/: https://drive.google.com/drive/folders/1KEnpCOU0dQU5p1tnEy3o9l02NE0uYnpm?usp=sharing
Make sure contents of /genome_info/ are readable.
- For example if using Drosophila melanogaster, in a console type chmod -R 777 /Users/USERNAME/Desktop/test_folder/genomes_info/dme/.
Run TIMEOR via Docker
- On command line type
  - $ docker pull ashleymaeconard/timeor:latest
  - $ docker images
  - $ docker run -v /Users/USERNAME/Desktop/test_folder/:/srv/ -p 3838:3838 <IMAGE_ID>
Open TIMEOR Application is available by typing:
- Shiny server will be running on port 3838. Thus, in a browser visit localhost:3838.

Or, build Docker image

NOTE: This could take a while. Please follow these commands:

$ cd /PATH/TO/TIMEOR/
Build Docker image in TIMEOR directory:
- $ docker build -t timeor_env .
Follow instructions 3 and 4 above.
In another command line window
- $ docker container ls
- $ docker exec -it <CONTAINER_NAME> /bin/bash/
Now you have a console within Docker to run commands.