|
|
||
|
FEP - Format Use by a Researcher - Eduardo Santiago - UDF |
|
|
|
||
| Comment on this template in the HyperNews Discussion. |
1. Format (Format System) Identification
UDF
2. Original MotivationThe IMAGE mission distributes level 1 data in UDF. UDF has been unsatisfactory as a data format. It violates nearly every single one of my requirements for a file format (posted elsewhere):
3. Data TypesProcessing Level: Level 1. Object Types: Time Series, Multidimensional, Spectra.
4. SupportAn enormous amount of support has been necessary so far. I have been in constant contact with the UDF developer (UDF is pretty much a one-man show). The developer does provide prompt support: from responding to bug reports, to analyzing my code snippets and describing where my thinking has gone awry of the UDF model, he always seems to be available. However, one shouldn't need full-time access to a developer, just to read data. Even with constant support, "uh-oh" issues arise almost weekly. For instance, when I was unable to reposition the data pointer on one dataset, I was informed that I had to use the special routine ToThisTime(). That was the first I'd heard of it, and it wasn't clear why the existing file_pos() function didn't perform that function. Finally, it isn't always possible to resolve issues. On various occasions, I have reported bugs to the developer (along with copious documentation explaining why it's a bug, how to reproduce it, even how to fix it), and the bugs still remain.
5. SoftwareI have developed, from scratch, an IDL package that provides a simple open()/read()/close() interface to UDF. Working on the assumption that researchers just want to get their hands on their data, I have simplified UDF as much as is humanly possible. The complexity of the underlying code -- just for reading data, using UDF library calls -- is overwhelming: over four thousand lines of code (C and Perl) are required to provide this interface.
6. EnvironmentUNIX-only (Linux, Solaris) with GNU tools.
7. UsageSince we're given data in UDF, and duplicate storage costs are pretty high, we do keep the UDF files and read from them. All subsequent Level 2 (and beyond) products are written using HDF, of course.
8. Experience>Relative to its ability to carry and manage research-needed metadata This seems to be a -- possibly the -- fundamental rationale behind UDF. I don't know if or how it will succeed in this goal. Even if it performs as claimed, the cost is prohibitive. I've seen other missions accomplish this with far less effort. >Relative to its related software UDF does do something that no other format does: convert packed telemetry bits to "normal" quantities. However, IMnsHO, this should be done by ground station software which then saves and distributes data files in a common, robust format.
9. Desired FunctionalityNone. Although the idea behind UDF is a tempting one, there are too many problems with the implementation itself. A complete redesign and reimplementation, from ground zero is necessary to make it useful.
10. Selection CriteriaSee my comments elsewhere.
11. Impact on ResearchSee my comments elsewhere. For the most part, I like to accept data in whatever form the distributor likes (as long as it's well documented). If it's not in a useful form, I convert it to HDF or CDF (via automated scripts), and forevermore ignore the source data files. This has worked successfully so far, requiring only a few days' effort (at most) to be able to handle a new mission data format. After this, no thought is ever again given to data format, since generic wrapper functions are used to access data. With UDF, I've spent over six months getting things working. This unnecessarily hampered my productivity on other projects.
12. Other CommentsIn writing these comments, I have succeeded in alienating and infuriating a large number of people. Such is life. My intent is not to belittle or insult the UDF developer: he seems to be a terrific fellow, provides quite good support, and is really trying hard to make UDF work. However, that doesn't excuse UDF. UDF can dazzle with its promise of quick access to data, and instant-gratification pretty pictures. But any attempt to do anything more with it will immediately show its weaknesses.
|
|
|
Comment on this template in the
HyperNews Discussion.
|
||
|
|
||
Wider ViewsFormats Evolution Process (FEP) Discussion Forums PageFormats Evolution Process (FEP) Home Page NASA/Science Office of Standards and Technology (NOST) Home Page
|
||
|
|
||
|
URL: http://ssdoo.gsfc.nasa.gov/nost/fep/researcher-Eduardo Santiago-UDF.html
A service of
NOST at
NSSDC.
Author: Eduardo Santiago / Los Alamos National Laboratory / IMAGE/MENA (esm@lanl.gov) 505/665-3130
|
||
|
|
||