V4Sampler Status

From OVISWiki
Revision as of 08:45, 22 March 2018 by Baallan (talk | contribs) (Did, compiled, but cannot test)
Jump to: navigation, search

A number of samplers from v2 and v3 need updating or other change of status in v4 LDMS.

Here we catalog what's happening and any key rationale.

Fixed and tested

  • everything under cray_system_sampler
  • everything under aries_mmr
  • dvs_sampler
  • msr_interlagos
  • meminfo
  • array_example
  • kgnilnd
  • vmstat
  • Procinterrupts - NOTE: This may want to be updated to use arrays
  • procnetdev
  • procnfs
  • Procdiskstats - [runs, may need validation]
  • Procstat
  • Lnet_stats
  • Sampler_atasmart [runs, may need validations]
  • All_example
  • clock
  • fptrans
  • synthetic


Did, compiled, but cannot test

  • procsensors [Ben to longer term investigate configurability of data sources; existing code is dead-machine specific]
BAA: I propose we update procsensors to take a configuration file defining what it should collect.
The unattractive alternative is to put much of the lm_sensors logic into our code base.
The lines of the configuration file will each be a list of key/val pairs parsable with our existing libraries.
Values will include such items as filename of input, filename of label, scaling factor if conversion wanted, etc.
Handling as arrays will be up to the interpretation of the key/value pairs.
Each metric will be read from a file and individually timestamped, due to unknown an potentially wide variance in read times of sensors.
May need to multithread the collection and may need to run less frequently than other samplers.
  • Sysclassib - NOTE: this has warnings about use of a static var in an inline function
  • Llnl/edac.c [Ben to update]

Did, but cannot build

  • Rapl
  • Perfevent – has warning about ioctl

Cannot do (or not applicable). Authors will have to do these. SEE NOTES WITHIN:

  • Cray_power_sampler [NT]
  • Generic_sampler [NT]
  • Hfclock [NT]
  • Job_info
  • Job_info_slurm
  • Ldms_jobid – [BA maintaining]
  • Lustre – [NT]
  • Papi – because I think this is getting replaced
  • Switchx [NT]
  • Test_sampler [NN]
  • Timer_base [NT]
  • Tsampler [NT]
  • Variable [BA update base]

Retired

  • Hadoop - [BA archived MR821]
  • Knc_sampler – [BA archived MR821]
  • Power_sampler [BA archived MR821]