Changes for page The FLASH HDF5 structure

Last modified by sndueste on 2025/02/06 10:55

From version 44.1
edited by cpassow
on 2021/02/10 09:30
Change comment: There is no comment for this version
To version 56.1
edited by sndueste
on 2022/09/07 09:38
Change comment: There is no comment for this version

Summary

Details

Page properties
Author
... ... @@ -1,1 +1,1 @@
1 -XWiki.cpassow
1 +XWiki.sndueste
Content
... ... @@ -19,14 +19,18 @@
19 19  {{code language="none"}}> module load xray{{/code}}
20 20  {{code language="none"}}> hdfview{{/code}}
21 21  
22 -[[Contents>>doc:||anchor="Contents"]]
22 +or you can use
23 23  
24 -== The new (starting 2021) HDF5 format ==
24 +{{code language="none"}}> silx view{{/code}}
25 25  
26 26  \\
27 27  
28 -FLASH provides a conversion of its data acquisition (DAQ) to the commonly used [[HDF5>>url:https://www.hdfgroup.org/solutions/hdf5/||shape="rect"]] format. Correlated  data are mapped by a primary index called **train ID**. Every data set has an individual index of train IDs to identify the data even when data are missing or deviate in update rate.If the data set of choice contains gaps,  users have to decide how to treat missing values. DAQ channels are spread across various files with one file name pattern for each DAQ. This means users have to assemble data from different files if necessary.
28 +[[Contents>>doc:||anchor="Contents"]]
29 29  
30 +== The FLASH HDF5 format ==
31 +
32 +FLASH provides a conversion of its data acquisition (DAQ) to the commonly used [[HDF5>>url:https://www.hdfgroup.org/solutions/hdf5/||shape="rect"]] format. Correlated  data are mapped by a primary index called **train ID**. Every data set has an individual index of train IDs to identify the data even when data are missing or deviate in update rate. If the data set of choice contains gaps,  users have to decide how to treat missing values. DAQ channels are spread across various files with one file name pattern for each DAQ. This means users have to assemble data from different files if necessary.
33 +
30 30  The hierarchy is realized using a human readable named HDF tree with each DAQ channel containing the data sets "//value//" and "//index//". Additionally, the HDF group //zraw// contains a tree of the original DAQ channel names.
31 31  
32 32  Reference implementation, which follows the concept of Python libraries like Pandas, Xarray, or Dask, is given below.
... ... @@ -34,7 +34,7 @@
34 34  \\
35 35  
36 36  {{expand title="Discontinued HDF formats"}}
37 -== Comparison to FLASH's deprecated HDF formats ==
41 +=== Comparison to FLASH's deprecated HDF formats ===
38 38  
39 39  Before 2021, FLASH provided two different HDF formats formally known as //near-online// and //offline// HDF files.
40 40  
... ... @@ -43,9 +43,25 @@
43 43  While the near-online HDF files were converted live during the beamtime, the offline HDF files were manually compiled weeks/months later. Near-online HDF files were used by a provided API [[BeamtimeDaqAccess>>url:https://confluence.desy.de/display/FLASHUSER/Near-Online+data+analysis#Near-Onlinedataanalysis-BriefUserGuidetoBeamtimeDAQAccess||shape="rect"]], which also searches for DAQ channels in files. With assembled files "by run", all DAQ channels were existing in the same HDF file. While shorter runs usually fitted into one file, longer runs still had to be aggregated over several files. The creation of the assembled type of files still involves the use of fairly unstable DAQ Mex-functions As the environment required for using the Mex-functions is already deprecated, its continued existence is uncertain.
44 44  
45 45  The HDF hierarchy is almost identical in all formats. While in the the recent format each DAQ channel contains the data sets "value" and "index", the deprecated format had one data set for each DAQ channel. The deprecated formats supplies no time axis parameters for spectra data types.
50 +
51 +\\
52 +
53 +**HDF5 example files (old format)**
54 +
55 +Here we have a few HDF5 samples (User data combined with Photon diagnostics data) from a few beamtimes showing the different kind options.
56 +
57 +[[image:attach:image2019-10-21_17-2-50.png||thumbnail="true" width="300"]] [[download HDF5 (Images @ FL2)>>url:https://desycloud.desy.de/index.php/s/nyEgeCWJFC4gao2||shape="rect"]]
58 +
59 +\\
60 +
61 +[[~[~[image:attach:image2019-10-22_10-52-27.png~|~|thumbnail="true" width="300"~]~]download HDF5 (GHz ADC and OPIS @ FL2)>>url:https://desycloud.desy.de/index.php/s/AeA2kPNNnZgX95A||shape="rect"]]
62 +
63 +\\
64 +
65 +\\
46 46  {{/expand}}
47 47  
48 -=== HDF excerpt: ===
68 +=== HDF examples: ===
49 49  {{/layout-cell}}
50 50  {{/layout-section}}
51 51  
... ... @@ -83,12 +83,12 @@
83 83  
84 84  {{layout-section ac:type="single"}}
85 85  {{layout-cell}}
86 -{{info title="Sample scripts in python"}}
106 +{{info}}
87 87  === Reference implementation (Python) ===
88 88  
89 -[[~[~[image:attach:binder_badge.png~|~|thumbnail="true" width="120"~]~]>>url:https://mybinder.org/v2/git/https%3A%2F%2Fgitlab.desy.de%2Fchristopher.passow%2Fflash-daq-hdf/master||shape="rect"]]
109 +(imperative)  (% class="Object" %)[[https:~~/~~/gitlab.desy.de/christopher.passow/flash-daq-hdf>>url:https://gitlab.desy.de/christopher.passow/flash-daq-hdf||shape="rect"]]
90 90  
91 -(% class="Object" %)[[https:~~/~~/gitlab.desy.de/christopher.passow/flash-daq-hdf>>url:https://gitlab.desy.de/christopher.passow/flash-daq-hdf||shape="rect"]]
111 +(% class="Object" %)(object oriented) [[https:~~/~~/gitlab.desy.de/christopher.passow/fdh-builder>>url:https://gitlab.desy.de/christopher.passow/fdh-builder.git||shape="rect"]]
92 92  {{/info}}
93 93  {{/layout-cell}}
94 94  {{/layout-section}}
... ... @@ -95,49 +95,15 @@
95 95  
96 96  {{layout-section ac:type="single"}}
97 97  {{layout-cell}}
98 -== The discontinued (till 2021) FLASH HDF5 structure ==
118 +== Complete list of recordable parameters ==
99 99  
100 -The photon diagnostic, electron diagnostic and beamline information as well as the information about the pump-probe laser and the infrastructure offered for users (GHz/MHz ADCs) can be included in one HDF5 file which is organized according to train IDs. The general structure is:
101 -
102 -* Electron Diagnostic
103 -* Photon Diagnostics
104 -* Beamlines
105 -* Experiment
106 -* Timing
107 -[[image:attach:HDF5_structure.jpg||height="400"]]
108 -
109 -A detailed description of (most) channels can be found in the lower part of the hdf5 viewer:
110 -[[image:attach:HDF5_structure_desc.jpg||thumbnail="true" height="250"]]
111 -
112 -[[Contents>>doc:||anchor="Contents"]]
113 -
120 +The complete list for the relation between DOOCS names and HDF5 names for the recordable parameters can be found in [[DESY's Repository~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:https://stash.desy.de/projects/CS/repos/pah/browse/src/camp/data/channel2HdfName.dat||shape="rect"]].
114 114  \\
115 115  
116 -=== HDF5 example files (old format) ===
117 -
118 -Here we have a few HDF5 samples (User data combined with Photon diagnostics data) from a few beamtimes showing the different kind options.
119 -
120 -[[image:attach:image2019-10-21_17-2-50.png||thumbnail="true" width="300"]] [[download HDF5 (Images @ FL2)>>url:https://desycloud.desy.de/index.php/s/nyEgeCWJFC4gao2||shape="rect"]]
121 -
122 -\\
123 -
124 -[[~[~[image:attach:image2019-10-22_10-52-27.png~|~|thumbnail="true" width="300"~]~]download HDF5 (GHz ADC and OPIS @ FL2)>>url:https://desycloud.desy.de/index.php/s/AeA2kPNNnZgX95A||shape="rect"]]
125 -
126 -\\
127 -
128 -\\
129 -
130 -[[Contents>>doc:||anchor="Contents"]]
131 -
132 -\\
133 -
134 134  == Most popular FLASH parameters and their names in HDF5, DOOCS and (raw) DAQ ==
135 135  
136 136  {{id name="DOOCSparameters"/}}
137 137  
138 -The complete list for the relation between DOOCS names and HDF5 names for the recordable parameters can be found in [[DESY's Repository~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:https://stash.desy.de/projects/CS/repos/pah/browse/src/camp/data/channel2HdfName.dat||shape="rect"]].
139 -The most common and often used ones are summarized below:
140 -
141 141  Note, the HDF group and data set names apply to our HDF tree version since vers. 0.3.0.
142 142  
143 143  \\
... ... @@ -144,8 +144,6 @@
144 144  
145 145  === FLASH1 ===
146 146  
147 -\\
148 -
149 149  ==== Beamline info (FLASH1) ====
150 150  
151 151  {{code language="none"}}/FL1/Beamlines/Attenuator/pressure{{/code}}
... ... @@ -222,8 +222,7 @@
222 222  
223 223  ==== Photon Diagnostics SASE ([[GMD>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/gmd_intensity_and_position/index_eng.html||shape="rect"]]) ====
224 224  
225 -\\
226 -
209 +{{expand title="Discontinued GMD format (used until 2021)"}}
227 227  (% style="color: rgb(0,0,0);" %)**Discontinued GMD data recording / evaluation  (VME + PhotonFlux ML server)**
228 228  
229 229  {{code language="none"}}/FL1/Photon Diagnostic/GMD/Average energy/energy tunnel{{/code}}
... ... @@ -271,10 +271,11 @@
271 271  DAQ channel: {{code language="none"}}PBD.PHFLUX/BDA.ENERGYPULSE.FF{{/code}}
272 272  desc :Energy per pulse BDA (from e-) - uncorrected values. There are also values saved if there was no beam ... just background noise
273 273  units : a.u. (more or less µJ but need to be calibrated with the "Average energy" for good precision)** [[see here for help>>doc:FLASHUSER.jddd-linked help pages.Calibrating the pulse resolved (electron) data from GMD.WebHome]]**
257 +{{/expand}}
274 274  
275 275  \\
276 276  
277 -(% style="color: rgb(0,0,0);" %)**NEW (2021) GMD data recording / evaluation  (MTCA, analog to FLASH2 and XFEL)**
261 +(% style="letter-spacing: 0px; color: rgb(0, 0, 0)" %)**NEW (since 2021) GMD data recording / evaluation  (same format as FLASH2 and XFEL)**
278 278  
279 279  {{code language="none"}}/FL1/Photon Diagnostic/GMD/Average energy/energy tunnel{{/code}}
280 280  //always saved (PBD)//
... ... @@ -451,7 +451,7 @@
451 451  {{info title="BAM hints"}}
452 452  * besides the arrivaltime from FLASH1 there is also the FLASH2/3 electron arrival time saved.In case of doubt ask your local contact
453 453  * [[Link a collection of papers related to the BAM and the analysis of pump-probe experiments >>doc:FLASHUSER.Additional helpful things.FLASH beamlines and instruments references.WebHome]]
454 -* [[LINK to detailed infos from MSK (may only work inside DESY network~[~[image:url:http://hasfweb.desy.de/pub/TWiki/TWikiDocGraphics/external-link.gif~|~|width="13" height="12"~]~]>>url:http://www.desy.de/~~mbock/pages/BAM_daq_channel_descriptions.html||shape="rect"]]
438 +* [[LINK to detailed infos from MSK>>url:https://confluence.desy.de/display/SDiagPublic/BAM+Data+Structure||shape="rect"]]
455 455  * a recent  [[talk about the working principle of the BAM>>attach:BAM-basics and outlook-2018_DESY-template_16-9Format.pdf]]
456 456  {{/info}}
457 457  
... ... @@ -550,7 +550,7 @@
550 550  //always saved (PBD)//
551 551  DOOCS prop : {{code language="none"}}TTF2.DIAG/PBD.TOROID.ML/12EXP/CHARGE.TD{{/code}}
552 552  DAQ channel: {{code language="none"}}TTF2.DIAG/PBD.TOROID.ML/12EXP{{/code}}
553 -desc: The bunch pattern as function of time in a burst recorded by toroide diagnostic BEHIND the undulator. (FLASH1)
537 +desc: The bunch pattern as function of time in a burst recorded by toroid diagnostic BEHIND the undulator. (FLASH1)
554 554  units:
555 555  \\
556 556  
... ... @@ -623,7 +623,7 @@
623 623  //always saved (PBD)//
624 624  DOOCS prop : {{code language="none"}}TTF2.FEL/TDOLFEL/TDOLFEL/STREAK.CAM.TIME{{/code}}
625 625  DOOCS prop : {{code language="none"}}TTF2.FEL/TDOLFEL/TDOLFEL/STREAK.CAM.TIME{{/code}}
626 -desc: delaytime between the optical laser and the FEL units: ps
610 +desc: delay time between the optical laser and the FEL units: ps
627 627  
628 628  \\
629 629  
... ... @@ -633,7 +633,7 @@
633 633  
634 634  ==== User Data (FLASH1) ====
635 635  
636 -The data saved specifically for detectors at an experiment will show up in /Experiment/ there is a large number of options for cameras or monitoring pslow properties (motor positons etc) for user experiments. For details please ask your local contact.
620 +The data saved specifically for detectors at an experiment will show up in /Experiment/ there is a large number of options for cameras or monitoring of slow properties (motor positions etc) for user experiments. For details please ask your local contact.
637 637  
638 638  NOTE: If parameters for an experiment are included on short notice the correct naming in the HDF5 may not be in time and the data will show up in /uncategorized/ with the DOOCS names
639 639  
... ... @@ -661,13 +661,13 @@
661 661  {{code language="none"}}/FL1/Experiment/BL3/ADQ412 GHz ADC/CH03/TD{{/code}}
662 662  
663 663  DOOCS prop : {{code language="none"}}FLASH.FEL/ADC.ADQ.PG/EXP1.CH00/CH00.TD  or CH00.DAQ.TD{{/code}}
664 -here the {{code language="none"}}CH00.TD{{/code}} is the full ADC trace as it is sampled ( typically several 100.000 samples per pulse train) while the {{code language="none"}}CH00.DAQ.TD{{/code}} trace only has the number of samples which are sent to the DAQ OR if //grouping// is activated the {{code language="none"}}CH00.DAQ.TD{{/code}} conatins only the grouped spectra. To read the ADC trace with an online analysis program the {{code language="none"}}CH00.DAQ.TD{{/code}} is used preferablly.
648 +here the {{code language="none"}}CH00.TD{{/code}} is the full ADC trace as it is sampled ( typically several 100.000 samples per pulse train) while the {{code language="none"}}CH00.DAQ.TD{{/code}} trace only has the number of samples which are sent to the DAQ OR if //grouping// is activated the {{code language="none"}}CH00.DAQ.TD{{/code}} contains only the grouped spectra. To read the ADC trace with an online analysis program the {{code language="none"}}CH00.DAQ.TD{{/code}} is used preferably.
665 665  DAQ channel: {{code language="none"}}FLASH.FEL/ADC.ADQ.PG/EXP1.CH00{{/code}}
666 666  
667 667  In addition there are also additional parameters saved like:
668 668  
669 669  * {{code language="none"}}sample frequency{{/code}}: it shows the sample frequency in MHz (number of samples per µs). NOTE: the clock of the ADC is NOT synchronized to the FLASH timing system. Thus the number of samples between bunches in the bunch train may be not integer numbers which will be show up for long bunch trains.
670 -* {{code language="none"}}number of samples{{/code}}: total number of samoles recorded for each 10 Hz trigger
654 +* {{code language="none"}}number of samples{{/code}}: total number of samples recorded for each 10 Hz trigger
671 671  * {{code language="none"}}error (ADC):{{/code}} 0 indicates that there was no error
672 672  
673 673  ===== MHz ADCs =====
... ... @@ -680,7 +680,7 @@
680 680  In addition there are also additional parameters saved like:
681 681  
682 682  * {{code language="none"}}sample frequency{{/code}}: it shows the sample frequency in MHz (number of samples per µs). NOTE: the clock of the ADC is NOT synchronized to the FLASH timing system. Thus the number of samples between bunches in the bunch train may be not integer numbers which will be show up for long bunch trains.
683 -* {{code language="none"}}number of samples{{/code}}: total number of samoles recorded for each 10 Hz trigger
667 +* {{code language="none"}}number of samples{{/code}}: total number of samples recorded for each 10 Hz trigger
684 684  
685 685  [[Contents>>doc:||anchor="Contents"]]
686 686  
... ... @@ -841,7 +841,7 @@
841 841  // saved opon request (PBD2)//
842 842  DOOCS prop : {{code language="none"}}FLASH.UTIL/STORE/FL2.TUNNEL.OPIS/VAL040{{/code}}
843 843  DAQ channel:{{code language="none"}} FLASH.UTIL/STORE/FL2.TUNNEL.OPIS/VAL040{{/code}}
844 -desc : meanwavelength ( ~~ 1 sec averaging time ) measured in the TUNNEL for a specific bunch out of the bunch train (via photoelectron spectroscopy)
828 +desc : mean wavelength ( ~~ 1 sec averaging time ) measured in the TUNNEL for a specific bunch out of the bunch train (via photoelectron spectroscopy)
845 845  units : nm
846 846  
847 847  \\
... ... @@ -855,11 +855,11 @@
855 855  
856 856  \\
857 857  
858 -If Opis is running typically on the the averaged data is saved. For several experiments it may make sense to save the information for each single bunch. This is up to now done by savng the compleate ADC trace of the TOF setup. This is a huge amount of data and needs processing. This has to be performed after the beamtime in close contact to [[Markus Braune>>mailto:markus.braune@desy.de||shape="rect"]] ( respobsible for [[OPIS>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/opis_spectrometer/index_eng.html||shape="rect"]])
842 +If Opis is running typically on the the averaged data is saved. For several experiments it may make sense to save the information for each single bunch. This is up to now done by saving the complete ADC trace of the TOF setup. This is a huge amount of data and needs processing. This has to be performed after the beamtime in close contact to [[Markus Braune>>mailto:markus.braune@desy.de||shape="rect"]] ( responsible for [[OPIS>>url:http://photon-science.desy.de/facilities/flash/photon_diagnostics/opis_spectrometer/index_eng.html||shape="rect"]])
859 859  
860 860  \\
861 861  
862 -In case OPIS was not operating there is still informaton about the **set wavelength** for the undulators (see below) which may differ by up to 5 % from the actual wavelength due to different settings in the FEL ...
846 +In case OPIS was not operating there is still information about the **set wavelength** for the undulators (see below) which may differ by up to 5 % from the actual wavelength due to different settings in the FEL ...
863 863  
864 864  \\
865 865  
... ... @@ -878,8 +878,21 @@
878 878  
879 879  \\
880 880  
881 -===== undulator settings =====
865 +===== electron bunch energy =====
882 882  
867 +{{code language="none"}}/FL2/Electron Diagnostic/Electron energy/energy of first bunch/behind undulators{{/code}}
868 +//always saved (PBD2)//
869 +DOOCS prop : {{code language="none"}}FLASH.DIAG/BEAM_ENERGY_MEASUREMENT/FL2XTDS/ENERGY.FLASH2{{/code}}
870 +DAQ channel: (% style="color: rgb(94,108,132);" %)TTF2.DAQ/PBD2.BEAM.ENERGY.MEAS.ML.COPY/FL2XTDS.ENERGY.FLASH2{{code language="none"}}{{/code}}(%%)
871 +desc: electron bunch energy measured behind the undulator. Data is saved with 10 Hz - BUT (for computation reasons) only the energy of the FIRST bunch is recorded. The data is also available for (% style="color: rgb(94,108,132);" %)extraction and septum  in the beginning of FLASH2
872 +
873 +(% style="letter-spacing: 0.0px;" %)units: (% class="twikiNewLink" %)MeV
874 +
875 +\\
876 +
877 +(% style="color: rgb(94,108,132);font-weight: 600;letter-spacing: 0.0px;" %)
878 +undulator settings
879 +
883 883  {{code language="none"}}/FL2/Electron Diagnostic/Undulator setting/set wavelength{{/code}}
884 884  //always saved (PBD2)//
885 885  DOOCS prop : {{code language="none"}}TTF2.FEEDBACK/FL2.WAVELENGTHCONTROL/FLASH2/WAVELENGTH{{/code}}
... ... @@ -1107,6 +1107,7 @@
1107 1107  
1108 1108  \\
1109 1109  
1107 +(% class="wrapped" %)
1110 1110  |=(((
1111 1111  earlier HDF path (vers. 0.2)
1112 1112  )))|=(((
... ... @@ -1208,14 +1208,6 @@
1208 1208  [[Contents>>doc:||anchor="Contents"]]
1209 1209  
1210 1210  \\
1211 -
1212 -== HDF5 and DOOCS ==
1213 -
1214 -Here is an outdated [[list with the available properties that are always saved (PBD) for FLASH1 as>>attach:FLASH1__DaqChannel2HdfNamePbd.xlsx]] HDF5 names and the corresponding DOOCS names
1215 -
1216 -[[Contents>>doc:||anchor="Contents"]]
1217 -
1218 -\\
1219 1219  {{/layout-cell}}
1220 1220  {{/layout-section}}
1221 1221  {{/layout}}