Skip to content

Latest commit

 

History

History
67 lines (41 loc) · 3.78 KB

OutputFiles.md

File metadata and controls

67 lines (41 loc) · 3.78 KB

Output Files

ShinyLearner produces output files that follow the tidy data principles. This affords extra flexibility and consistency when interpreting the results. These tab-delimited text files can be imported directly into third-party analytic tools, such as Microsoft Excel, R, or Python. The descriptions below indicate what each output file contains. The UserScripts documentation indicates which output files are produced for each type of analysis.

Metrics.tsv

Predictions.tsv

  • Contains individual-level predictions for each sample/instance for which predictions were made.
  • Example

SelectedFeatures.tsv

  • A ranked list of features for each feature-selection algorithm.
  • Example
  • Note: If multiple data files were used as input, the path to the file name will be used as a prefix before each feature name.

ElapsedTime.tsv

  • Indicates how long it took for each algorithm to execute.
  • Example

When nested cross validation is used to optimize hyperparameters or to select features, ShinyLearner produces output files that describe algorithm performance within the training sets. The following descriptions indicate what these files contain.

Nested_Metrics.tsv

Nested_Best.tsv

  • Indicates which hyperparameter combination performed best for each classification algorithm.

Nested_Predictions.tsv

  • Contains individual-level predictions for each sample/instance for which predictions were made for each algorithm.
  • Example

Nested_SelectedFeatures.tsv

  • A ranked list of features for each feature-selection algorithm.
  • Example
  • Note: If multiple data files were used as input, the path to the file name will be used as a prefix before each feature name.

Nested_SelectedFeatures_Summarized.tsv

  • A ranked list of features, averaged (using Borda count) across all iterations, for each feature-selection algorithm.
  • Example
  • Note: If multiple data files were used as input, the path to the file name will be used as a prefix before each feature name.

Nested_Classification_ElapsedTime.tsv

  • Indicates how long it took for each classification algorithm to execute.
  • Example

Nested_FeatureSelection_ElapsedTime.tsv

  • Indicates how long it took for each feature-selection algorithm to execute.
  • Example

Log.txt

  • Contains the text that was printed to Standard output.
  • Example