Changes for page The FLASH HDF5 structure
Last modified by sndueste on 2025/02/06 10:55
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,6 +4,3 @@ 1 -{{layout}} 2 -{{layout-section ac:type="single"}} 3 -{{layout-cell}} 4 4 == Contents == 5 5 6 6 ... ... @@ -19,18 +19,14 @@ 19 19 {{code language="none"}}> module load xray{{/code}} 20 20 {{code language="none"}}> hdfview{{/code}} 21 21 22 -o r youcanuse19 +[[Contents>>doc:||anchor="Contents"]] 23 23 24 - {{codelanguage="none"}}>silxview{{/code}}21 +== The new (starting 2021) HDF5 format == 25 25 26 26 \\ 27 27 28 -[[Contents >>doc:||anchor="Contents"]]25 +FLASH provides a conversion of its data acquisition (DAQ) to the commonly used [[HDF5>>url:https://www.hdfgroup.org/solutions/hdf5/||shape="rect"]] format. Correlated data are mapped by a primary index called **train ID**. Every data set has an individual index of train IDs to identify the data even when data are missing or deviate in update rate.If the data set of choice contains gaps, users have to decide how to treat missing values. DAQ channels are spread across various files with one file name pattern for each DAQ. This means users have to assemble data from different files if necessary. 29 29 30 -== The FLASH HDF5 format == 31 - 32 -FLASH provides a conversion of its data acquisition (DAQ) to the commonly used [[HDF5>>url:https://www.hdfgroup.org/solutions/hdf5/||shape="rect"]] format. Correlated data are mapped by a primary index called **train ID**. Every data set has an individual index of train IDs to identify the data even when data are missing or deviate in update rate. If the data set of choice contains gaps, users have to decide how to treat missing values. DAQ channels are spread across various files with one file name pattern for each DAQ. This means users have to assemble data from different files if necessary. 33 - 34 34 The hierarchy is realized using a human readable named HDF tree with each DAQ channel containing the data sets "//value//" and "//index//". Additionally, the HDF group //zraw// contains a tree of the original DAQ channel names. 35 35 36 36 Reference implementation, which follows the concept of Python libraries like Pandas, Xarray, or Dask, is given below. ... ... @@ -38,7 +38,7 @@ 38 38 \\ 39 39 40 40 {{expand title="Discontinued HDF formats"}} 41 -== =Comparison to FLASH's deprecated HDF formats ===34 +== Comparison to FLASH's deprecated HDF formats == 42 42 43 43 Before 2021, FLASH provided two different HDF formats formally known as //near-online// and //offline// HDF files. 44 44 ... ... @@ -47,77 +47,66 @@ 47 47 While the near-online HDF files were converted live during the beamtime, the offline HDF files were manually compiled weeks/months later. Near-online HDF files were used by a provided API [[BeamtimeDaqAccess>>url:https://confluence.desy.de/display/FLASHUSER/Near-Online+data+analysis#Near-Onlinedataanalysis-BriefUserGuidetoBeamtimeDAQAccess||shape="rect"]], which also searches for DAQ channels in files. With assembled files "by run", all DAQ channels were existing in the same HDF file. While shorter runs usually fitted into one file, longer runs still had to be aggregated over several files. The creation of the assembled type of files still involves the use of fairly unstable DAQ Mex-functions As the environment required for using the Mex-functions is already deprecated, its continued existence is uncertain. 48 48 49 49 The HDF hierarchy is almost identical in all formats. While in the the recent format each DAQ channel contains the data sets "value" and "index", the deprecated format had one data set for each DAQ channel. The deprecated formats supplies no time axis parameters for spectra data types. 43 +{{/expand}} 50 50 45 +== HDF excerpt: == 46 + 51 51 \\ 52 52 53 - **HDF5example files (oldformat)**49 +ADC data as example for **fast** **data** (10 Hz): 54 54 55 - Here we have a few HDF5 samples (User data combined with Photon diagnostics data) from a few beamtimes showing the different kind options.51 +\\ 56 56 57 -[[image:attach:image20 19-10-21_17-2-50.png||thumbnail="true" width="300"]] [[download HDF5 (Images @ FL2)>>url:https://desycloud.desy.de/index.php/s/nyEgeCWJFC4gao2||shape="rect"]]53 +[[image:attach:image2020-11-16_15-26-28.png||height="250"]] 58 58 59 59 \\ 60 60 61 -[[~[~[image:attach:image2019-10-22_10-52-27.png~|~|thumbnail="true" width="300"~]~]download HDF5 (GHz ADC and OPIS @ FL2)>>url:https://desycloud.desy.de/index.php/s/AeA2kPNNnZgX95A||shape="rect"]] 57 +{{info title="Sample scripts in python"}} 58 +== Reference implementation (Python) == 62 62 60 +[[~[~[image:attach:binder_badge.png~|~|thumbnail="true" width="120"~]~]>>url:https://mybinder.org/v2/git/https%3A%2F%2Fgitlab.desy.de%2Fchristopher.passow%2Fflash-daq-hdf/master||shape="rect"]] 61 + 62 +(% class="Object" %)[[https:~~/~~/gitlab.desy.de/christopher.passow/flash-daq-hdf>>url:https://gitlab.desy.de/christopher.passow/flash-daq-hdf||shape="rect"]] 63 +{{/info}} 64 + 63 63 \\ 64 64 65 65 \\ 66 -{{/expand}} 67 67 68 -=== HDF examples: === 69 -{{/layout-cell}} 70 -{{/layout-section}} 69 +== The discontinued (till 2021) FLASH HDF5 structure == 71 71 72 -{{layout-section ac:type="three_equal"}} 73 -{{layout-cell}} 74 -* ADC data as example for **fast** **data** (10 Hz): 71 +The photon diagnostic, electron diagnostic and beamline information as well as the information about the pump-probe laser and the infrastructure offered for users (GHz/MHz ADCs) can be included in one HDF5 file which is organized according to train IDs. The general structure is: 75 75 76 -\\ 73 +* Electron Diagnostic 74 +* Photon Diagnostics 75 +* Beamlines 76 +* Experiment 77 +* Timing 78 +[[image:attach:HDF5_structure.jpg||height="400"]] 77 77 78 - [[image:attach:image2020-11-16_15-26-28.png||height="250"]]79 - {{/layout-cell}}80 +A detailed description of (most) channels can be found in the lower part of the hdf5 viewer: 81 +[[image:attach:HDF5_structure_desc.jpg||thumbnail="true" height="250"]] 80 80 81 -{{layout-cell}} 82 -* The //average// FEL pulse energy as example for the **slow** **data** (different e.g. 1 Hz): 83 +[[Contents>>doc:||anchor="Contents"]] 83 83 84 - ( every 10th train ID is listed in the HDF group "index") 85 - 86 86 \\ 87 87 88 -[[image:attach:image2020-11-16_15-31-45.png||height="250"]] 89 -{{/layout-cell}} 87 +=== HDF5 example files === 90 90 91 -{{layout-cell}} 92 -* ((( 93 -//zraw// group contains the **original DAQ (DOOCS) names** 94 -))) 89 +Here we have a few HDF5 samples (User data combined with Photon diagnostics data) from a few beamtimes showing the different kind options. 95 95 96 - oftheropertiessavedintheDESYinternalrawformat.(Fors)91 +[[image:attach:image2019-10-21_17-2-50.png||thumbnail="true" width="300"]] [[download HDF5 (Images @ FL2)>>url:https://desycloud.desy.de/index.php/s/nyEgeCWJFC4gao2||shape="rect"]] 97 97 98 98 \\ 99 99 100 -[[image:attach:image2020-11-16_16-26-3.png||height="400"]] 101 -{{/layout-cell}} 102 -{{/layout-section}} 95 +[[~[~[image:attach:image2019-10-22_10-52-27.png~|~|thumbnail="true" width="300"~]~]download HDF5 (GHz ADC and OPIS @ FL2)>>url:https://desycloud.desy.de/index.php/s/AeA2kPNNnZgX95A||shape="rect"]] 103 103 104 -{{layout-section ac:type="single"}} 105 -{{layout-cell}} 106 -{{info}} 107 -=== Reference implementation (Python) === 97 +\\ 108 108 109 - (imperative) (% class="Object" %)[[https:~~/~~/gitlab.desy.de/christopher.passow/flash-daq-hdf>>url:https://gitlab.desy.de/christopher.passow/flash-daq-hdf||shape="rect"]]99 +\\ 110 110 111 -(% class="Object" %)(object oriented) [[https:~~/~~/gitlab.desy.de/christopher.passow/fdh-builder>>url:https://gitlab.desy.de/christopher.passow/fdh-builder.git||shape="rect"]] 112 -{{/info}} 113 -{{/layout-cell}} 114 -{{/layout-section}} 101 +[[Contents>>doc:||anchor="Contents"]] 115 115 116 -{{layout-section ac:type="single"}} 117 -{{layout-cell}} 118 -== Complete list of recordable parameters == 119 - 120 -The complete list for the relation between DOOCS names and HDF5 names for the recordable parameters can be found in [[DESY's Repository~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:https://stash.desy.de/projects/CS/repos/pah/browse/src/camp/data/channel2HdfName.dat||shape="rect"]]. 121 121 \\ 122 122 123 123 == Most popular FLASH parameters and their names in HDF5, DOOCS and (raw) DAQ == ... ... @@ -124,6 +124,9 @@ 124 124 125 125 {{id name="DOOCSparameters"/}} 126 126 109 +The complete list for the relation between DOOCS names and HDF5 names for the recordable parameters can be found in [[DESY's Repository~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:https://stash.desy.de/projects/CS/repos/pah/browse/src/camp/data/channel2HdfName.dat||shape="rect"]]. 110 +The most common and often used ones are summarized below: 111 + 127 127 Note, the HDF group and data set names apply to our HDF tree version since vers. 0.3.0. 128 128 129 129 \\ ... ... @@ -130,6 +130,8 @@ 130 130 131 131 === FLASH1 === 132 132 118 +\\ 119 + 133 133 ==== Beamline info (FLASH1) ==== 134 134 135 135 {{code language="none"}}/FL1/Beamlines/Attenuator/pressure{{/code}} ... ... @@ -208,7 +208,6 @@ 208 208 209 209 \\ 210 210 211 -{{expand title="GMD format untill 2021"}} 212 212 (% style="color: rgb(0,0,0);" %)**Discontinued GMD data recording / evaluation (VME + PhotonFlux ML server)** 213 213 214 214 {{code language="none"}}/FL1/Photon Diagnostic/GMD/Average energy/energy tunnel{{/code}} ... ... @@ -256,12 +256,9 @@ 256 256 DAQ channel: {{code language="none"}}PBD.PHFLUX/BDA.ENERGYPULSE.FF{{/code}} 257 257 desc :Energy per pulse BDA (from e-) - uncorrected values. There are also values saved if there was no beam ... just background noise 258 258 units : a.u. (more or less µJ but need to be calibrated with the "Average energy" for good precision)** [[see here for help>>doc:FLASHUSER.jddd-linked help pages.Calibrating the pulse resolved (electron) data from GMD.WebHome]]** 259 -{{/expand}} 260 260 261 261 \\ 262 262 263 -\\ 264 - 265 265 (% style="color: rgb(0,0,0);" %)**NEW (2021) GMD data recording / evaluation (MTCA, analog to FLASH2 and XFEL)** 266 266 267 267 {{code language="none"}}/FL1/Photon Diagnostic/GMD/Average energy/energy tunnel{{/code}} ... ... @@ -538,7 +538,7 @@ 538 538 //always saved (PBD)// 539 539 DOOCS prop : {{code language="none"}}TTF2.DIAG/PBD.TOROID.ML/12EXP/CHARGE.TD{{/code}} 540 540 DAQ channel: {{code language="none"}}TTF2.DIAG/PBD.TOROID.ML/12EXP{{/code}} 541 -desc: The bunch pattern as function of time in a burst recorded by toroid diagnostic BEHIND the undulator. (FLASH1) 524 +desc: The bunch pattern as function of time in a burst recorded by toroide diagnostic BEHIND the undulator. (FLASH1) 542 542 units: 543 543 \\ 544 544 ... ... @@ -611,7 +611,7 @@ 611 611 //always saved (PBD)// 612 612 DOOCS prop : {{code language="none"}}TTF2.FEL/TDOLFEL/TDOLFEL/STREAK.CAM.TIME{{/code}} 613 613 DOOCS prop : {{code language="none"}}TTF2.FEL/TDOLFEL/TDOLFEL/STREAK.CAM.TIME{{/code}} 614 -desc: delay 597 +desc: delaytime between the optical laser and the FEL units: ps 615 615 616 616 \\ 617 617 ... ... @@ -621,7 +621,7 @@ 621 621 622 622 ==== User Data (FLASH1) ==== 623 623 624 -The data saved specifically for detectors at an experiment will show up in /Experiment/ there is a large number of options for cameras or monitoring ofslow properties (motor positions etc) for user experiments. For details please ask your local contact.607 +The data saved specifically for detectors at an experiment will show up in /Experiment/ there is a large number of options for cameras or monitoring pslow properties (motor positons etc) for user experiments. For details please ask your local contact. 625 625 626 626 NOTE: If parameters for an experiment are included on short notice the correct naming in the HDF5 may not be in time and the data will show up in /uncategorized/ with the DOOCS names 627 627 ... ... @@ -649,13 +649,13 @@ 649 649 {{code language="none"}}/FL1/Experiment/BL3/ADQ412 GHz ADC/CH03/TD{{/code}} 650 650 651 651 DOOCS prop : {{code language="none"}}FLASH.FEL/ADC.ADQ.PG/EXP1.CH00/CH00.TD or CH00.DAQ.TD{{/code}} 652 -here the {{code language="none"}}CH00.TD{{/code}} is the full ADC trace as it is sampled ( typically several 100.000 samples per pulse train) while the {{code language="none"}}CH00.DAQ.TD{{/code}} trace only has the number of samples which are sent to the DAQ OR if //grouping// is activated the {{code language="none"}}CH00.DAQ.TD{{/code}} con tains only the grouped spectra. To read the ADC trace with an online analysis program the {{code language="none"}}CH00.DAQ.TD{{/code}} is used preferably.635 +here the {{code language="none"}}CH00.TD{{/code}} is the full ADC trace as it is sampled ( typically several 100.000 samples per pulse train) while the {{code language="none"}}CH00.DAQ.TD{{/code}} trace only has the number of samples which are sent to the DAQ OR if //grouping// is activated the {{code language="none"}}CH00.DAQ.TD{{/code}} conatins only the grouped spectra. To read the ADC trace with an online analysis program the {{code language="none"}}CH00.DAQ.TD{{/code}} is used preferablly. 653 653 DAQ channel: {{code language="none"}}FLASH.FEL/ADC.ADQ.PG/EXP1.CH00{{/code}} 654 654 655 655 In addition there are also additional parameters saved like: 656 656 657 657 * {{code language="none"}}sample frequency{{/code}}: it shows the sample frequency in MHz (number of samples per µs). NOTE: the clock of the ADC is NOT synchronized to the FLASH timing system. Thus the number of samples between bunches in the bunch train may be not integer numbers which will be show up for long bunch trains. 658 -* {{code language="none"}}number of samples{{/code}}: total number of sam ples recorded for each 10 Hz trigger641 +* {{code language="none"}}number of samples{{/code}}: total number of samoles recorded for each 10 Hz trigger 659 659 * {{code language="none"}}error (ADC):{{/code}} 0 indicates that there was no error 660 660 661 661 ===== MHz ADCs ===== ... ... @@ -668,7 +668,7 @@ 668 668 In addition there are also additional parameters saved like: 669 669 670 670 * {{code language="none"}}sample frequency{{/code}}: it shows the sample frequency in MHz (number of samples per µs). NOTE: the clock of the ADC is NOT synchronized to the FLASH timing system. Thus the number of samples between bunches in the bunch train may be not integer numbers which will be show up for long bunch trains. 671 -* {{code language="none"}}number of samples{{/code}}: total number of sam ples recorded for each 10 Hz trigger654 +* {{code language="none"}}number of samples{{/code}}: total number of samoles recorded for each 10 Hz trigger 672 672 673 673 [[Contents>>doc:||anchor="Contents"]] 674 674 ... ... @@ -829,7 +829,7 @@ 829 829 // saved opon request (PBD2)// 830 830 DOOCS prop : {{code language="none"}}FLASH.UTIL/STORE/FL2.TUNNEL.OPIS/VAL040{{/code}} 831 831 DAQ channel:{{code language="none"}} FLASH.UTIL/STORE/FL2.TUNNEL.OPIS/VAL040{{/code}} 832 -desc : mean 815 +desc : meanwavelength ( ~~ 1 sec averaging time ) measured in the TUNNEL for a specific bunch out of the bunch train (via photoelectron spectroscopy) 833 833 units : nm 834 834 835 835 \\ ... ... @@ -843,11 +843,11 @@ 843 843 844 844 \\ 845 845 846 -If Opis is running typically on the the averaged data is saved. For several experiments it may make sense to save the information for each single bunch. This is up to now done by sav ing the complete ADC trace of the TOF setup. This is a huge amount of data and needs processing. This has to be performed after the beamtime in close contact to [[Markus Braune>>mailto:markus.braune@desy.de||shape="rect"]] ( responsible for [[OPIS>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/opis_spectrometer/index_eng.html||shape="rect"]])829 +If Opis is running typically on the the averaged data is saved. For several experiments it may make sense to save the information for each single bunch. This is up to now done by savng the compleate ADC trace of the TOF setup. This is a huge amount of data and needs processing. This has to be performed after the beamtime in close contact to [[Markus Braune>>mailto:markus.braune@desy.de||shape="rect"]] ( respobsible for [[OPIS>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/opis_spectrometer/index_eng.html||shape="rect"]]) 847 847 848 848 \\ 849 849 850 -In case OPIS was not operating there is still informat ion about the **set wavelength** for the undulators (see below) which may differ by up to 5 % from the actual wavelength due to different settings in the FEL ...833 +In case OPIS was not operating there is still informaton about the **set wavelength** for the undulators (see below) which may differ by up to 5 % from the actual wavelength due to different settings in the FEL ... 851 851 852 852 \\ 853 853 ... ... @@ -866,21 +866,8 @@ 866 866 867 867 \\ 868 868 869 -===== electronbunchenergy=====852 +===== undulator settings ===== 870 870 871 -{{code language="none"}}/FL2/Electron Diagnostic/Electron energy/energy of first bunch/behind undulators{{/code}} 872 -//always saved (PBD2)// 873 -DOOCS prop : {{code language="none"}}FLASH.DIAG/BEAM_ENERGY_MEASUREMENT/FL2XTDS/ENERGY.FLASH2{{/code}} 874 -DAQ channel: (% style="color: rgb(94,108,132);" %)TTF2.DAQ/PBD2.BEAM.ENERGY.MEAS.ML.COPY/FL2XTDS.ENERGY.FLASH2{{code language="none"}}{{/code}}(%%) 875 -desc: electron bunch energy measured behind the undulator. Data is saved with 10 Hz - BUT (for computation reasons) only the energy of the FIRST bunch is recorded. The data is also available for (% style="color: rgb(94,108,132);" %)extraction and septum in the beginning of FLASH2 876 - 877 -(% style="letter-spacing: 0.0px;" %)units: (% class="twikiNewLink" %)MeV 878 - 879 -\\ 880 - 881 -(% style="color: rgb(94,108,132);font-weight: 600;letter-spacing: 0.0px;" %) 882 -undulator settings 883 - 884 884 {{code language="none"}}/FL2/Electron Diagnostic/Undulator setting/set wavelength{{/code}} 885 885 //always saved (PBD2)// 886 886 DOOCS prop : {{code language="none"}}TTF2.FEEDBACK/FL2.WAVELENGTHCONTROL/FLASH2/WAVELENGTH{{/code}} ... ... @@ -1210,6 +1210,13 @@ 1210 1210 [[Contents>>doc:||anchor="Contents"]] 1211 1211 1212 1212 \\ 1213 -{{/layout-cell}} 1214 -{{/layout-section}} 1215 -{{/layout}} 1183 + 1184 +== Example code showing how to access HDF5 files == 1185 + 1186 +\\ 1187 + 1188 +== HDF5 and DOOCS == 1189 + 1190 +Here is an outdated [[list with the available properties that are always saved (PBD) for FLASH1 as>>attach:FLASH1__DaqChannel2HdfNamePbd.xlsx]] HDF5 names and the corresponding DOOCS names 1191 + 1192 +[[Contents>>doc:||anchor="Contents"]]