Commit 97005ad6 authored by Anna Wyszomirska's avatar Anna Wyszomirska
Browse files

new genome data added

parent 370d53fd
Genome is a Spark application related to the analysis of genome data within a scientific biomedical context. This deployment was run on instances with 2 cores. Reconfiguration was not allowed, min number of machines was 1, maximum 1. The application was running for about 6 hours.​ Each metric was a count in the 30-second window.
The following files contain relevant information:
- EstimatedRemainingTime Context.csv - column 'value' informs about estimated remaining time to the end of the simulation
- MinimumCoresContext.csv - column 'value' informs about minimum cores required to finish simulation on time
- NotFinishedOnTimeContext.csv
- RemainingSimulationTimeMetric.csv - column 'value' informs about how much time is left to the end of the simulation
- SimulationElapsedTime.csv - column 'value' inform about how much time last from the beginning of the simulation
- SimulationLeftNumber.csv - column 'value' informs about how many simulations were finished
- WillFinishTooSoonContext.csv
- nodeTable.csv - column 'totalCount' has information about how many instances are currently running
Entry data set for this deployment: https://s3-eu-west-1.amazonaws.com/melodic.testing.data/mdfs/VERY_BIG_data.csv
Utility Function:
//utility function
variable Utility{
template MetricTemplateCamelModel.MetricTemplateModel.UtilityTemplate
formula: ('1/(WorkerCardinality * WorkerPrice)')
}
Markdown is supported
0% or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment