Changes for page The FLASH HDF5 structure
Last modified by sndueste on 2025/02/06 10:55
Summary
-
Page properties (1 modified, 0 added, 0 removed)
Details
- Page properties
-
- Content
-
... ... @@ -1,3 +1,6 @@ 1 +{{layout}} 2 +{{layout-section ac:type="single"}} 3 +{{layout-cell}} 1 1 == Contents == 2 2 3 3 ... ... @@ -16,14 +16,18 @@ 16 16 {{code language="none"}}> module load xray{{/code}} 17 17 {{code language="none"}}> hdfview{{/code}} 18 18 19 - [[Contents>>doc:||anchor="Contents"]]22 +or you can use 20 20 21 - == The new(starting2021) HDF5 format ==24 +{{code language="none"}}> silx view{{/code}} 22 22 23 23 \\ 24 24 25 - FLASH provides a conversion of its data acquisition (DAQ) to the commonly used[[HDF5>>url:https://www.hdfgroup.org/solutions/hdf5/||shape="rect"]] format.Correlated data are mapped by a primary index called **train ID**. Every data set has anindividual index oftrain IDsto identify the data even when data are missingor deviate in update rate.If the data set ofchoice contains gaps, users have to decidehow to treat missing values. DAQ channels are spread across various files with one file name patternfor each DAQ. This means users haveto assemble data from different files if necessary.28 +[[Contents>>doc:||anchor="Contents"]] 26 26 30 +== The FLASH HDF5 format == 31 + 32 +FLASH provides a conversion of its data acquisition (DAQ) to the commonly used [[HDF5>>url:https://www.hdfgroup.org/solutions/hdf5/||shape="rect"]] format. Correlated data are mapped by a primary index called **train ID**. Every data set has an individual index of train IDs to identify the data even when data are missing or deviate in update rate. If the data set of choice contains gaps, users have to decide how to treat missing values. DAQ channels are spread across various files with one file name pattern for each DAQ. This means users have to assemble data from different files if necessary. 33 + 27 27 The hierarchy is realized using a human readable named HDF tree with each DAQ channel containing the data sets "//value//" and "//index//". Additionally, the HDF group //zraw// contains a tree of the original DAQ channel names. 28 28 29 29 Reference implementation, which follows the concept of Python libraries like Pandas, Xarray, or Dask, is given below. ... ... @@ -31,7 +31,7 @@ 31 31 \\ 32 32 33 33 {{expand title="Discontinued HDF formats"}} 34 -== Comparison to FLASH's deprecated HDF formats == 41 +=== Comparison to FLASH's deprecated HDF formats === 35 35 36 36 Before 2021, FLASH provided two different HDF formats formally known as //near-online// and //offline// HDF files. 37 37 ... ... @@ -40,71 +40,77 @@ 40 40 While the near-online HDF files were converted live during the beamtime, the offline HDF files were manually compiled weeks/months later. Near-online HDF files were used by a provided API [[BeamtimeDaqAccess>>url:https://confluence.desy.de/display/FLASHUSER/Near-Online+data+analysis#Near-Onlinedataanalysis-BriefUserGuidetoBeamtimeDAQAccess||shape="rect"]], which also searches for DAQ channels in files. With assembled files "by run", all DAQ channels were existing in the same HDF file. While shorter runs usually fitted into one file, longer runs still had to be aggregated over several files. The creation of the assembled type of files still involves the use of fairly unstable DAQ Mex-functions As the environment required for using the Mex-functions is already deprecated, its continued existence is uncertain. 41 41 42 42 The HDF hierarchy is almost identical in all formats. While in the the recent format each DAQ channel contains the data sets "value" and "index", the deprecated format had one data set for each DAQ channel. The deprecated formats supplies no time axis parameters for spectra data types. 43 -{{/expand}} 44 44 45 - === HDF excerpt: ===51 +\\ 46 46 47 -* ADC data as example for **fast** **data** (10 Hz): [[image:attach:image2020-11-16_15-26-28.png||height="250"]] 48 -* The //average// FEL pulse energy as example for the **slow** **data** (different e.g. 1 Hz):[[image:attach:image2020-11-16_15-31-45.png||height="250"]] 53 +**HDF5 example files (old format)** 49 49 50 - very10th trainIDislistedin theHDF group"index")55 +Here we have a few HDF5 samples (User data combined with Photon diagnostics data) from a few beamtimes showing the different kind options. 51 51 52 -* ((( 53 -//zraw// group contains the **original DAQ (DOOCS) names[[image:attach:image2020-11-16_16-26-3.png||thumbnail="true" height="250"]]** 54 -))) 57 +[[image:attach:image2019-10-21_17-2-50.png||thumbnail="true" width="300"]] [[download HDF5 (Images @ FL2)>>url:https://desycloud.desy.de/index.php/s/nyEgeCWJFC4gao2||shape="rect"]] 55 55 56 - of the properties saved in the DESY internal raw format. (For experts) 57 - 58 58 \\ 59 59 60 - \\61 +[[~[~[image:attach:image2019-10-22_10-52-27.png~|~|thumbnail="true" width="300"~]~]download HDF5 (GHz ADC and OPIS @ FL2)>>url:https://desycloud.desy.de/index.php/s/AeA2kPNNnZgX95A||shape="rect"]] 61 61 62 -{{info title="Sample scripts in python"}} 63 -=== Reference implementation (Python) === 64 - 65 -[[~[~[image:attach:binder_badge.png~|~|thumbnail="true" width="120"~]~]>>url:https://mybinder.org/v2/git/https%3A%2F%2Fgitlab.desy.de%2Fchristopher.passow%2Fflash-daq-hdf/master||shape="rect"]] 66 - 67 -(% class="Object" %)[[https:~~/~~/gitlab.desy.de/christopher.passow/flash-daq-hdf>>url:https://gitlab.desy.de/christopher.passow/flash-daq-hdf||shape="rect"]] 68 -{{/info}} 69 - 70 70 \\ 71 71 72 72 \\ 66 +{{/expand}} 73 73 74 -== The discontinued (till 2021) FLASH HDF5 structure == 68 +=== HDF examples: === 69 +{{/layout-cell}} 70 +{{/layout-section}} 75 75 76 -The photon diagnostic, electron diagnostic and beamline information as well as the information about the pump-probe laser and the infrastructure offered for users (GHz/MHz ADCs) can be included in one HDF5 file which is organized according to train IDs. The general structure is: 72 +{{layout-section ac:type="three_equal"}} 73 +{{layout-cell}} 74 +* ADC data as example for **fast** **data** (10 Hz): 77 77 78 -* Electron Diagnostic 79 -* Photon Diagnostics 80 -* Beamlines 81 -* Experiment 82 -* Timing 83 -[[image:attach:HDF5_structure.jpg||height="400"]] 76 +\\ 84 84 85 - A detailed description of (most)channels canbefoundin the lower partof the hdf5viewer:86 - [[image:attach:HDF5_structure_desc.jpg||thumbnail="true" height="250"]]78 +[[image:attach:image2020-11-16_15-26-28.png||height="250"]] 79 +{{/layout-cell}} 87 87 88 -[[Contents>>doc:||anchor="Contents"]] 81 +{{layout-cell}} 82 +* The //average// FEL pulse energy as example for the **slow** **data** (different e.g. 1 Hz): 89 89 84 + ( every 10th train ID is listed in the HDF group "index") 85 + 90 90 \\ 91 91 92 -=== HDF5 example files (old format) === 88 +[[image:attach:image2020-11-16_15-31-45.png||height="250"]] 89 +{{/layout-cell}} 93 93 94 -Here we have a few HDF5 samples (User data combined with Photon diagnostics data) from a few beamtimes showing the different kind options. 91 +{{layout-cell}} 92 +* ((( 93 +//zraw// group contains the **original DAQ (DOOCS) names** 94 +))) 95 95 96 - [[image:attach:image2019-10-21_17-2-50.png||thumbnail="true"width="300"]][[downloadHDF5 (Images@FL2)>>url:https://desycloud.desy.de/index.php/s/nyEgeCWJFC4gao2||shape="rect"]]96 + of the properties saved in the DESY internal raw format. (For experts) 97 97 98 98 \\ 99 99 100 -[[~[~[image:attach:image2019-10-22_10-52-27.png~|~|thumbnail="true" width="300"~]~]download HDF5 (GHz ADC and OPIS @ FL2)>>url:https://desycloud.desy.de/index.php/s/AeA2kPNNnZgX95A||shape="rect"]] 100 +[[image:attach:image2020-11-16_16-26-3.png||height="400"]] 101 +{{/layout-cell}} 102 +{{/layout-section}} 101 101 102 -\\ 104 +{{layout-section ac:type="single"}} 105 +{{layout-cell}} 106 +{{info}} 107 +=== Reference implementation (Python) === 103 103 104 - \\109 +(imperative) (% class="Object" %)[[https:~~/~~/gitlab.desy.de/christopher.passow/flash-daq-hdf>>url:https://gitlab.desy.de/christopher.passow/flash-daq-hdf||shape="rect"]] 105 105 106 -[[Contents>>doc:||anchor="Contents"]] 111 +(% class="Object" %)(object oriented) [[https:~~/~~/gitlab.desy.de/christopher.passow/fdh-builder>>url:https://gitlab.desy.de/christopher.passow/fdh-builder.git||shape="rect"]] 112 +{{/info}} 113 +{{/layout-cell}} 114 +{{/layout-section}} 107 107 116 +{{layout-section ac:type="single"}} 117 +{{layout-cell}} 118 +== Complete list of recordable parameters == 119 + 120 +The complete list for the relation between DOOCS names and HDF5 names for the recordable parameters can be found in [[DESY's Repository~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:https://stash.desy.de/projects/CS/repos/pah/browse/src/camp/data/channel2HdfName.dat||shape="rect"]]. 108 108 \\ 109 109 110 110 == Most popular FLASH parameters and their names in HDF5, DOOCS and (raw) DAQ == ... ... @@ -111,9 +111,6 @@ 111 111 112 112 {{id name="DOOCSparameters"/}} 113 113 114 -The complete list for the relation between DOOCS names and HDF5 names for the recordable parameters can be found in [[DESY's Repository~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:https://stash.desy.de/projects/CS/repos/pah/browse/src/camp/data/channel2HdfName.dat||shape="rect"]]. 115 -The most common and often used ones are summarized below: 116 - 117 117 Note, the HDF group and data set names apply to our HDF tree version since vers. 0.3.0. 118 118 119 119 \\ ... ... @@ -120,8 +120,6 @@ 120 120 121 121 === FLASH1 === 122 122 123 -\\ 124 - 125 125 ==== Beamline info (FLASH1) ==== 126 126 127 127 {{code language="none"}}/FL1/Beamlines/Attenuator/pressure{{/code}} ... ... @@ -198,8 +198,7 @@ 198 198 199 199 ==== Photon Diagnostics SASE ([[GMD>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/gmd_intensity_and_position/index_eng.html||shape="rect"]]) ==== 200 200 201 -\\ 202 - 209 +{{expand title="Discontinued GMD format (used until 2021)"}} 203 203 (% style="color: rgb(0,0,0);" %)**Discontinued GMD data recording / evaluation (VME + PhotonFlux ML server)** 204 204 205 205 {{code language="none"}}/FL1/Photon Diagnostic/GMD/Average energy/energy tunnel{{/code}} ... ... @@ -247,10 +247,11 @@ 247 247 DAQ channel: {{code language="none"}}PBD.PHFLUX/BDA.ENERGYPULSE.FF{{/code}} 248 248 desc :Energy per pulse BDA (from e-) - uncorrected values. There are also values saved if there was no beam ... just background noise 249 249 units : a.u. (more or less µJ but need to be calibrated with the "Average energy" for good precision)** [[see here for help>>doc:FLASHUSER.jddd-linked help pages.Calibrating the pulse resolved (electron) data from GMD.WebHome]]** 257 +{{/expand}} 250 250 251 251 \\ 252 252 253 -(% style="color: rgb(0,0,0) ;" %)**NEW (2021) GMD data recording / evaluation (MTCA,analogto FLASH2 and XFEL)**261 +(% style="letter-spacing: 0px; color: rgb(0, 0, 0)" %)**NEW (since 2021) GMD data recording / evaluation (same format as FLASH2 and XFEL)** 254 254 255 255 {{code language="none"}}/FL1/Photon Diagnostic/GMD/Average energy/energy tunnel{{/code}} 256 256 //always saved (PBD)// ... ... @@ -427,7 +427,7 @@ 427 427 {{info title="BAM hints"}} 428 428 * besides the arrivaltime from FLASH1 there is also the FLASH2/3 electron arrival time saved.In case of doubt ask your local contact 429 429 * [[Link a collection of papers related to the BAM and the analysis of pump-probe experiments >>doc:FLASHUSER.Additional helpful things.FLASH beamlines and instruments references.WebHome]] 430 -* [[LINK to detailed infos from MSK (may only work inside DESY network~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:http://www.desy.de/~~mbock/pages/BAM_daq_channel_descriptions.html||shape="rect"]]438 +* [[LINK to detailed infos from MSK>>url:https://confluence.desy.de/display/SDiagPublic/BAM+Data+Structure||shape="rect"]] 431 431 * a recent [[talk about the working principle of the BAM>>attach:BAM-basics and outlook-2018_DESY-template_16-9Format.pdf]] 432 432 {{/info}} 433 433 ... ... @@ -526,7 +526,7 @@ 526 526 //always saved (PBD)// 527 527 DOOCS prop : {{code language="none"}}TTF2.DIAG/PBD.TOROID.ML/12EXP/CHARGE.TD{{/code}} 528 528 DAQ channel: {{code language="none"}}TTF2.DIAG/PBD.TOROID.ML/12EXP{{/code}} 529 -desc: The bunch pattern as function of time in a burst recorded by toroid ediagnostic BEHIND the undulator. (FLASH1)537 +desc: The bunch pattern as function of time in a burst recorded by toroid diagnostic BEHIND the undulator. (FLASH1) 530 530 units: 531 531 \\ 532 532 ... ... @@ -599,7 +599,7 @@ 599 599 //always saved (PBD)// 600 600 DOOCS prop : {{code language="none"}}TTF2.FEL/TDOLFEL/TDOLFEL/STREAK.CAM.TIME{{/code}} 601 601 DOOCS prop : {{code language="none"}}TTF2.FEL/TDOLFEL/TDOLFEL/STREAK.CAM.TIME{{/code}} 602 -desc: delaytime between the optical laser and the FEL units: ps 610 +desc: delay time between the optical laser and the FEL units: ps 603 603 604 604 \\ 605 605 ... ... @@ -609,7 +609,7 @@ 609 609 610 610 ==== User Data (FLASH1) ==== 611 611 612 -The data saved specifically for detectors at an experiment will show up in /Experiment/ there is a large number of options for cameras or monitoring pslow properties (motor positons etc) for user experiments. For details please ask your local contact.620 +The data saved specifically for detectors at an experiment will show up in /Experiment/ there is a large number of options for cameras or monitoring of slow properties (motor positions etc) for user experiments. For details please ask your local contact. 613 613 614 614 NOTE: If parameters for an experiment are included on short notice the correct naming in the HDF5 may not be in time and the data will show up in /uncategorized/ with the DOOCS names 615 615 ... ... @@ -637,13 +637,13 @@ 637 637 {{code language="none"}}/FL1/Experiment/BL3/ADQ412 GHz ADC/CH03/TD{{/code}} 638 638 639 639 DOOCS prop : {{code language="none"}}FLASH.FEL/ADC.ADQ.PG/EXP1.CH00/CH00.TD or CH00.DAQ.TD{{/code}} 640 -here the {{code language="none"}}CH00.TD{{/code}} is the full ADC trace as it is sampled ( typically several 100.000 samples per pulse train) while the {{code language="none"}}CH00.DAQ.TD{{/code}} trace only has the number of samples which are sent to the DAQ OR if //grouping// is activated the {{code language="none"}}CH00.DAQ.TD{{/code}} con atins only the grouped spectra. To read the ADC trace with an online analysis program the {{code language="none"}}CH00.DAQ.TD{{/code}} is used preferablly.648 +here the {{code language="none"}}CH00.TD{{/code}} is the full ADC trace as it is sampled ( typically several 100.000 samples per pulse train) while the {{code language="none"}}CH00.DAQ.TD{{/code}} trace only has the number of samples which are sent to the DAQ OR if //grouping// is activated the {{code language="none"}}CH00.DAQ.TD{{/code}} contains only the grouped spectra. To read the ADC trace with an online analysis program the {{code language="none"}}CH00.DAQ.TD{{/code}} is used preferably. 641 641 DAQ channel: {{code language="none"}}FLASH.FEL/ADC.ADQ.PG/EXP1.CH00{{/code}} 642 642 643 643 In addition there are also additional parameters saved like: 644 644 645 645 * {{code language="none"}}sample frequency{{/code}}: it shows the sample frequency in MHz (number of samples per µs). NOTE: the clock of the ADC is NOT synchronized to the FLASH timing system. Thus the number of samples between bunches in the bunch train may be not integer numbers which will be show up for long bunch trains. 646 -* {{code language="none"}}number of samples{{/code}}: total number of sam oles recorded for each 10 Hz trigger654 +* {{code language="none"}}number of samples{{/code}}: total number of samples recorded for each 10 Hz trigger 647 647 * {{code language="none"}}error (ADC):{{/code}} 0 indicates that there was no error 648 648 649 649 ===== MHz ADCs ===== ... ... @@ -656,7 +656,7 @@ 656 656 In addition there are also additional parameters saved like: 657 657 658 658 * {{code language="none"}}sample frequency{{/code}}: it shows the sample frequency in MHz (number of samples per µs). NOTE: the clock of the ADC is NOT synchronized to the FLASH timing system. Thus the number of samples between bunches in the bunch train may be not integer numbers which will be show up for long bunch trains. 659 -* {{code language="none"}}number of samples{{/code}}: total number of sam oles recorded for each 10 Hz trigger667 +* {{code language="none"}}number of samples{{/code}}: total number of samples recorded for each 10 Hz trigger 660 660 661 661 [[Contents>>doc:||anchor="Contents"]] 662 662 ... ... @@ -817,7 +817,7 @@ 817 817 // saved opon request (PBD2)// 818 818 DOOCS prop : {{code language="none"}}FLASH.UTIL/STORE/FL2.TUNNEL.OPIS/VAL040{{/code}} 819 819 DAQ channel:{{code language="none"}} FLASH.UTIL/STORE/FL2.TUNNEL.OPIS/VAL040{{/code}} 820 -desc : meanwavelength ( ~~ 1 sec averaging time ) measured in the TUNNEL for a specific bunch out of the bunch train (via photoelectron spectroscopy) 828 +desc : mean wavelength ( ~~ 1 sec averaging time ) measured in the TUNNEL for a specific bunch out of the bunch train (via photoelectron spectroscopy) 821 821 units : nm 822 822 823 823 \\ ... ... @@ -831,11 +831,11 @@ 831 831 832 832 \\ 833 833 834 -If Opis is running typically on the the averaged data is saved. For several experiments it may make sense to save the information for each single bunch. This is up to now done by savng the comple ate ADC trace of the TOF setup. This is a huge amount of data and needs processing. This has to be performed after the beamtime in close contact to [[Markus Braune>>mailto:markus.braune@desy.de||shape="rect"]] ( respobsible for [[OPIS>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/opis_spectrometer/index_eng.html||shape="rect"]])842 +If Opis is running typically on the the averaged data is saved. For several experiments it may make sense to save the information for each single bunch. This is up to now done by saving the complete ADC trace of the TOF setup. This is a huge amount of data and needs processing. This has to be performed after the beamtime in close contact to [[Markus Braune>>mailto:markus.braune@desy.de||shape="rect"]] ( responsible for [[OPIS>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/opis_spectrometer/index_eng.html||shape="rect"]]) 835 835 836 836 \\ 837 837 838 -In case OPIS was not operating there is still informaton about the **set wavelength** for the undulators (see below) which may differ by up to 5 % from the actual wavelength due to different settings in the FEL ... 846 +In case OPIS was not operating there is still information about the **set wavelength** for the undulators (see below) which may differ by up to 5 % from the actual wavelength due to different settings in the FEL ... 839 839 840 840 \\ 841 841 ... ... @@ -854,8 +854,21 @@ 854 854 855 855 \\ 856 856 857 -===== undulatorsettings=====865 +===== electron bunch energy ===== 858 858 867 +{{code language="none"}}/FL2/Electron Diagnostic/Electron energy/energy of first bunch/behind undulators{{/code}} 868 +//always saved (PBD2)// 869 +DOOCS prop : {{code language="none"}}FLASH.DIAG/BEAM_ENERGY_MEASUREMENT/FL2XTDS/ENERGY.FLASH2{{/code}} 870 +DAQ channel: (% style="color: rgb(94,108,132);" %)TTF2.DAQ/PBD2.BEAM.ENERGY.MEAS.ML.COPY/FL2XTDS.ENERGY.FLASH2{{code language="none"}}{{/code}}(%%) 871 +desc: electron bunch energy measured behind the undulator. Data is saved with 10 Hz - BUT (for computation reasons) only the energy of the FIRST bunch is recorded. The data is also available for (% style="color: rgb(94,108,132);" %)extraction and septum in the beginning of FLASH2 872 + 873 +(% style="letter-spacing: 0.0px;" %)units: (% class="twikiNewLink" %)MeV 874 + 875 +\\ 876 + 877 +(% style="color: rgb(94,108,132);font-weight: 600;letter-spacing: 0.0px;" %) 878 +undulator settings 879 + 859 859 {{code language="none"}}/FL2/Electron Diagnostic/Undulator setting/set wavelength{{/code}} 860 860 //always saved (PBD2)// 861 861 DOOCS prop : {{code language="none"}}TTF2.FEEDBACK/FL2.WAVELENGTHCONTROL/FLASH2/WAVELENGTH{{/code}} ... ... @@ -1185,15 +1185,6 @@ 1185 1185 [[Contents>>doc:||anchor="Contents"]] 1186 1186 1187 1187 \\ 1188 - 1189 -== HDF5 and DOOCS == 1190 - 1191 -Here is an outdated [[list with the available properties that are always saved (PBD) for FLASH1 as>>attach:FLASH1__DaqChannel2HdfNamePbd.xlsx]] HDF5 names and the corresponding DOOCS names 1192 - 1193 -[[Contents>>doc:||anchor="Contents"]] 1194 - 1195 -\\ 1196 - 1197 -\\ 1198 - 1199 -\\ 1209 +{{/layout-cell}} 1210 +{{/layout-section}} 1211 +{{/layout}}