-
Notifications
You must be signed in to change notification settings - Fork 6
/
01-eml-metadata.Rmd
2402 lines (2089 loc) · 97.1 KB
/
01-eml-metadata.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
# EML Best Practices
This document contains current 'Best Practice' recommendations for EML content for metadata related to ecological and environmental data. It is intended to augment the EML schema documentation [@EML_2019] for a less-technical audience. This is one component of several resources available to EML preparers. These recommendations are directed towards the following goals:
- Provide guidance and clarification in the implementation of EML for datasets
- Minimize heterogeneity of EML documents to simplify development and re-use of software built to ingest it
- Maximize interoperability of EML documents to facilitate data synthesis
## Introduction {-}
EML Best Practice recommendations have evolved over time. The most active contributors have been members of the LTER Information Managers Committee in multiple working groups and workshops. EML has been widely used for several years with multiple applications written against it, and the community has had the opportunity to observe the consequences of many content patterns. As much as possible, recommendations have been aligned with those experiences, as well as with the capability of data contributors.
Timeline and Previous Revisions
- 2017 Best Practices for Dataset Metadata in EML v3 (this document)
- 2016 EDI inception, see http://edirepository.org
- 2011 EML Best Practices for LTER sites v2
- 2008 EML 2.1 release
- 2004 EML Best Practices for LTER sites
- 2003 LTER adopts EML as network exchange standard
Contributors, including LTER EML Best Practices Working Groups and workshops in 2003, 2004, 2010 (alphabetical order):
- Dan Bahauddin
- Barbara Benson
- Emery Boose
- James Brunt
- Duane Costa
- Corinna Gries
- Don Henshaw
- Margaret O'Brien
- Ken Ramsey
- Inigo San Gil
- Mark Servilla
- Wade Sheldon
- Philip Tarrant
- Theresa Valentine
- John Vande Castle,
- Kristin Vanderbilt
- Jonathan Walsh
- Yang Xia
## Conventions and Definitions {-}
### Audience
This document is intended for data managers. It assumes that readers are familiar with
- the basic structure of an XML document, and the ability to edit in an XML
editor like OxygenXML or XMLSpy.
- the process for contributing data to a repository. If you reached this
document from a repository's help-page, contact them for more information.
### Fonts and typeface
Numbered examples of EML nodes are in fixed-width font:
```xml
<?xml version="1.0" encoding="UTF-8"?>
```
XML element and attribute names, XPath and references to element names
in text are in bold face. Single element names are surrounded by angle
brackets, as they appear in XML.
<**dataTable**>
**/eml:eml/\@packageId**
Some recommendations have special context, e.g., an XML element or
attribute may be requested by a community (e.g., LTER), or required by
the EDI repository (but not by other repositories).
_Context notes: Recommendations for EML usage in a specific context are
called "context notes", and are placed in separate paragraphs, in
italic._
### Definitions
<dl>
<dt>EML preparer</dt>
<dd>the person responsible for "building" the EML metadata
record. Generally, this is a data manager working with a project or
physical site that produces data.</dd>
<dt>Contributor</dt>
<dd>the research project contributing the data package, e.g.,
an LTER or OBFS site, or a Macrosystems project. Generally, the "EML
preparer" works with or for the "Contributor."</dd>
<dt>Data package</dt>
<dd>the EML metadata together with its entity or entities.
This is generally the unit housed in repositories. We use this term to
avoid confusion with the EML element "<b>dataset</b>".</dd>
</dl>
### Other EML Resources
Some sections refer to further information or tools. These can be found
on the EDI website, under "Resources", at
[https://edirepository.org](https://edirepository.org)
## General Recommendations {-}
Following are general best practices for handling EML dataset metadata:
### Metadata Distribution
Do not publicly distribute EML documents containing elements with incorrect information, e.g., as a workaround for missing metadata or to meet validation requirements. Pre-publication drafts, or EML produced for demonstration or testing purposes should be clearly identified as such and not contributed to public archives, because these are passed on to large-scale clearinghouses. For previews of drafts or handling test and demonstration data packages, consult your repository to learn about options.
### Data Package Identifiers
Metadata and data set versioning are controlled by the contributor, and so identifiers are tied to local systems. Many repository systems that accept EML-described data support principles of immutable metadata and data entity versioning. EML has elements to contain package identifiers, although these may also be assigned externally. It is the responsibility of the submitters to understand the practices of their intended repository when using identifiers.
### High-priority Elements
- To support locating data by time, geographic location, and taxonomically, metadata should provide as much information as possible for the data package, in the three <**coverage**>; elements:
- <**temporalCoverage**>; (when),
- <**geographicCoverage**>; (where) and
- <**taxonomicCoverage**> (what).
- For a potential user to evaluate the relevance and usability of the data package for their research study or synthesis projects, metadata should include detailed descriptions in the
- <**project**>,
- <**methods**>,
- <**protocols**>, and
- <**intellectualRights**> elements.
## The root element: \<eml:eml> {#Root_element .unnumbered}
This element is the root element in all EML documents. The XPath
notation is:
**/eml:eml**
The root element holds two important parts, both of which are optional,
but recommended.
### \@schemaLocation (XML attribute)
This attribute is this location (XPath):
**/eml:eml/\@schemaLocation**
The schemaLocation attribute tells a processor the name of the schema to
which the EML document belongs and where to find it. Most repositories
check schema compliance when data packages are deposited, but it is
highly recommended that data managers know how and where to specify the
schema that their metadata document should adhere to. This way, they can
validate their own work in progress, e.g., through an XML editor like
OxygenXML.
### \@packageId (XML attribute)
This attribute is found at this location (XPath):
**/eml:eml/\@packageId**
As outlined elsewhere, EML preparers should manage unique identifiers
and versioning at the local level (see **\@system** discussion below).
The **packageId** attribute can be used to contain the same identifier
as is used by the repository.
_Context Note: The packageID attribute is required in all EML documents
submitted to the EDI repository. It is entered into the repository software,
and theformat is standardized to three parts: scope, package-number, revision.
The scope should be "edi" unless another scope is justified by prior
arrangement. See Example 1._
Example 1: attributes packageId, id, system, and scope
```xml
<?xml version="1.0" encoding="UTF-8"?>
<eml:eml xmlns:ds="eml://ecoinformatics.org/dataset-2.1.0"
xmlns:xs="http://www.w3.org/2001/XMLSchema"
xmlns:eml="eml://ecoinformatics.org/eml-2.1.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xmlns:stmml="http://www.xml-cml.org/schema/stmml"
xsi:schemaLocation="eml://ecoinformatics.org/eml-2.1.0
https://nis.lternet.edu/eml-2.1.0/eml.xsd"
packageId="knb-lter-fls.21.3"
system="FLS"
scope="system">
```
## Top Level Elements {-}
An EML **dataset** is composed of up to three elements under the root
element (<**eml:eml**>):
<**access**> More information: [access](#access)
<**dataset**> More information: [dataset](#dataset)
<**additionalMetadata**>More information: [additionalMetadata](#additionalMetadata)
### access {#access}
The access element is found at this location (XPath):
**/eml:eml/access**
**/eml:eml/\[entityType\]/physical/distribution/access**
<**access**> contains a list of rules defining permissions for this
metadata record and its data entity. Values must be applicable by the
system where data is stored. Many repositories follow the KNB system of
using access control format that conforms to the LDAP "distinguishedName
(dn)" for an individual, as in
"uid=FLS,o=LTER,dc=ecoinformatics,dc=org".
As of EML 2.1.0, <**access**> trees are allowed at two places: as the
first child of the <**eml:eml**> root element (a sibling to
<**dataset**>) for controlling access to the entire document, and in a
**physical/distribution** tree for controlling access to the resource
URL. With the exception of certain sensitive information, metadata
should be publicly accessible. The <**access**> element is optional,
and if omitted, the repository may presume that only the dataset
submitter will be allowed access.
Example 2: access
```xml
<access authSystem="knb" order="allowFirst" scope="document">
<allow>
<principal>uid=FLS,o=lter,dc=ecoinformatics,dc=org</principal>
<permission>all</permission>
</allow>
<allow>
<principal>public</principal>
<permission>read</permission>
</allow>
</access>
```
### dataset {#dataset}
This element is found at these locations (XPath):
**/eml:eml/dataset**
Under <**dataset**>, the following elements are available. Some are
optional, but if they appear, this order is enforced by the schema.
Generally, the recommendations are presented here in this order, with
the exception of elements related to people and organizations which are
grouped together so that the distinctions between the uses of those
elements are clear. Elements that can appear at different levels within
an EML file are discussed at their first appearance, or highest level.
<**alternateIdentifer**>
<**shortName**>
<**title**>
<**creator**>
<**metadataProvider**>
<**associatedParty**>
<**pubDate**>
<**language**>
<**series**>
<**abstract**>
<**keywordSet**>
<**additionalInfo**>
<**intellectualRights**>
<**distribution**>
<**coverage**>
<**purpose** >
<**maintenance**>
<**contact**>
<**publisher**>
<**pubPlace**>
<**project**>
These elements are then followed by one or more elements for the data
entity (or entities), designated by choosing:
[ **dataTable** | **spatialRaster** | **spatialVector** |
**storedProcedure** | **view** | **otherEntity** ]
#### alternateIdentifier
The alternateIdentifier element is found at this location (XPath):
**/eml:eml/ dataset/alternateIdentifier**
**/eml:eml/ dataset/[entity]/alternateIdentifier**
The contributing organization's local data set identifier should be
listed as the EML <**alternateIdentifier**>, particularly when it
differs from the "**packageId**" attribute in the <**eml:eml**>
element. The <**alternateIdentifier**> should also be used to denote
that a package belongs to more than contributing organization by
including each individual ID in a separate <**alternateIdentifier**>
tag. At the entity level, the <**alternateIdentifier**> should contain
an alternate name for the data table (or other entity) itself (see
additional comments under entities, below.)
#### title (dataset)
The dataset title element is found at this location (XPath):
**/eml:eml/ dataset/title**
**/eml:eml/method/methodStep/protocol/title**
**/eml:eml/project/title**
The dataset <**title**> should be descriptive and should mention the
data collected, geographic context, research site, and time frame (what,
where, and when).
Example 3: dataset, alternateIdentifier, shortName, title
```xml
<dataset id="FLS-1" system="FLS" scope = "system">
<alternateIdentifier>FLS-1</alternateIdentifier>
<shortName>Arthropods</shortName>
<title>Long-term Ground Arthropod Monitoring Dataset at Ficity, USA
from 1998 to 2003</title>
```
### additionalMetadata {#additionalMetadata}
This element tree is found at (XPath):
**eml:eml/additionalMetadata**
<**additionalMetadata**> is a flexible field for including any other
relevant metadata that pertains to the resource being described. Its
content must be valid XML. A unit as a <**customUnit**> must be
described in this tree.
<**describes**> (optional) is a pointer to an "id" attribute on an EML
element ("id" described in another area). This pointer must be identical
to the attribute it is pointing at, so that automated processes are able
to associate <**additionalMetadata**> to the described attribute. If
the <**describes**> element is omitted, it is assumed that the
<**additionalMetadata**> content applies to the entire EML document.
<**metadata**> contains the additional metadata to be included in the
document. The contents can be any valid XML. This element should be used
for extending EML to include metadata that is not already available in
another part of the EML specification, or to include site- or
system-specific extensions that are needed beyond the core metadata. The
additional metadata contained in this field describes the element
referenced in the <**describes**> element preceding it. If
<**describes**> is not used, either <**metadata**> must contain
sufficient information to define the association between
<**additionalMetadata**> or the <**additionalMetadata**> can be
presumed to apply to the entire data package.
An example of "sufficient information to define the association" is the
definition of a <**customUnit**>. The EML Parser expects to find the
description of a <**customUnit**> in the id attribute of a
<**unit**> element in a <**unitList**>, i.e., at
/eml:eml/additionalMetadata/metadata/unitList/unit. For example,
`"stmml:unit id="siemensPerMeter"` points at the <**customUnit**>
`"siemensPerMeter"`. The EML Parser is available from GitHub, with the EML
project. For descriptions of custom units see "Other Resources".
Example 25: additionalMetadata custom unit
```xml
<additionalMetadata>
<metadata>
<stmml:unitList>
<stmml:unit id="siemensPerMeter" name="siemensPerMeter" abbreviation="S/m"
unitType="conductance" parentSI="siemen" multiplierToSI="1" constantToSI="0">
<stmml:description>conductivity unit</stmml:description>
</stmml:unit>
</stmml:unitList>
</metadata>
</additionalMetadata>
```
## title (dataset) {-}
The dataset title element is found at this location (XPath):
**/eml:eml/ dataset/title**
**/eml:eml/method/methodStep/protocol/title**
**/eml:eml/project/title**
The dataset <**title**> should be descriptive and should mention the
data collected, geographic context, research site, and time frame (what,
where, and when).
Example 3: dataset, alternateIdentifier, shortName, title
```xml
<dataset id="FLS-1" system="FLS" scope = "system">
<alternateIdentifier>FLS-1</alternateIdentifier>
<shortName>Arthropods</shortName>
<title>Long-term Ground Arthropod Monitoring Dataset at Ficity, USA
from 1998 to 2003</title>
```
## People and Organizations (Parties) {-}
People and organizations are all described using a "ResponsibleParty"
group of elements, which is found at these locations (XPath):
**/eml:eml/dataset/creator**
**/eml:eml/dataset/contact**
**/eml:eml/dataset/metadataProvider**
**/eml:eml/dataset/associatedParty**
**/eml:eml/dataset/publisher**
**/eml:eml/dataset/project/creator**
**/eml:eml/dataset/method/methodStep/protocol/creator**
**General recommendations**: When using <**individualName**>
elements anywhere within an EML document, names should be constructed
with English alphabetization in mind. Many sites have found that
maintaining full contact information for every creator is impractical,
however a few important contact information should be kept up to date
(see below). If a name includes a suffix, it should be included in the
<**surName**> element after the last name.
It is recommended to include complete contact information for a
permanent role that is independent of the person holding that position.
For example, for an information manager, site contact, pay careful
attention to phone number and use an e-mail alias that can be passed on.
(See below, under<**contact**>.)
With the advent of general identifiers such as ORCIDs, the text in the
<**address**>, <**phone**>, and <**onlineURL**> elements may
become unnecessary for individuals and so is optional if and an
individual's ORCID is included. <**electronicMailAddress**> is
recommended to simplify contacting responsible parties. See the
<**userId**> field. ORCID identifiers are not yet available for
organizations, so <**address**>, <**phone**>, and <**onlineURL**>
elements should be included for them. In the examples, these elements
are included for completeness.
### userId
This element is found at this location (XPath):
**/eml:eml/dataset/creator/userId**
**/eml:eml/dataset/contact/userId**
**/eml:eml/dataset/metadataProvider/userId**
**/eml:eml/dataset/associatedParty/userId**
**/eml:eml/dataset/publisher/userId**
**/eml:eml/dataset/project/creator/userId**
**/eml:eml/dataset/method/methodStep/protocol/creator/userId**
The optional <**userId**> field holds identifiers for responsible
parties from other systems. This element is repeatable so that multiple
systems can be referenced. EML prepares should contact the system they
plan to use to learn their preferences for inclusion in metadata. The
examples here are for ORCID identifiers, and that organization has asked
that its full URI be used as both the **system** attribute, and as the
head of the identifier itself.
Example 4: creator
```xml
<creator id="org-1" system="FLS" scope="system">
<organizationName>Fictitious LTER Site</organizationName>
<address>
<deliveryPoint>Department for Ecology</deliveryPoint>
<deliveryPoint>Fictitious State University</deliveryPoint>
<deliveryPoint>PO Box 111111</deliveryPoint>
<city>Ficity</city>
<administrativeArea>FI</administrativeArea>
<postalCode>11111-1111</postalCode>
</address>
<phone phonetype="voice">(999) 999-9999</phone>
<electronicMailAddress>[email protected]</electronicMailAddress>
<onlineUrl>http://www.fsu.edu/</onlineUrl>
<userId system="https://orcid.org">
https://orcid.org/0000-0000-0000-0000
</userId>
</creator>
<creator id="pos-1" system="FLS" scope="system">
<positionName>FLS Lead PI</positionName>
<address>
<deliveryPoint>Department for Ecology</deliveryPoint>
<deliveryPoint>Fictitious State University</deliveryPoint>
<deliveryPoint>PO Box 111111</deliveryPoint>
<city>Ficity</city>
<administrativeArea>FI</administrativeArea>
<postalCode>11111-1111</postalCode>
</address>
<phone phonetype="voice">(999) 999-9999</phone>
<electronicMailAddress>[email protected]</electronicMailAddress>
<onlineUrl>http://www.fsu.edu/</onlineUrl>
<userId system="https://orcid.org">
https://orcid.org/0000-0000-0000-0000
</userId>
</creator>
<creator id="pers-1" system="FLS" scope="system">
<individualName>
<salutation>Dr.</salutation>
<givenName>Joe</givenName>
<givenName>T.</givenName>
<surName>Ecologist Jr.</surName>
</individualName>
<organizationName>FSL LTER</organizationName>
<address>
<deliveryPoint>Department for Ecology</deliveryPoint>
<deliveryPoint>Fictitious State University</deliveryPoint>
<deliveryPoint>PO Box 111111</deliveryPoint>
<city>Ficity</city>
<administrativeArea>FI</administrativeArea>
<postalCode>11111-1111</postalCode>
</address>
<phone phonetype="voice">(999) 999-9999</phone>
<electronicMailAddress>[email protected]</electronicMailAddress>
<onlineUrl>http://www.fsu.edu/~jecologist</onlineUrl>
<userId system="https://orcid.org">
https://orcid.org/0000-0000-0000-0000
</userId>
</creator>
```
### creator
This element is found at this location (XPath):
**/eml:eml/dataset/creator**
The <**creator**> is considered to be the author of the data
package, i.e. the person(s) responsible for intellectual input into its
creation. <**surName>** and <**givenName>** elements are used to
build citations, so these should be completed fully for credit to be
understandable. For long-term data, e.g., from an LTER Site, preparers
should include the organization (using the <**organizationName**>) or
current principal investigator (PI, using <**postitionName**>). It
should be kept in mind that in the past, different approaches have led
to confusion over how to best search for long-term data, and searchers
frequently default to searches using PI's last name. Therefore it is a
reasonable practice to include more creators rather than fewer, even if
it blurs the credit for long-term data.
### metadataProvider
This element is found at this location (XPath):
**/eml:eml/dataset/metadataProvider**
The <**metadataProvider**> element lists the person or organization
responsible for producing or providing the metadata content. For primary
data sets generated by LTER sites, the LTER site should typically be
listed under <**metadataProvider**> using the <**organizationName**>
element. For acquired data sets, where the <**creator**> or
<**associatedParty**> are not the same people who produced the
metadata content, the actual metadata content provider should be listed
instead (see Example below).
Example 5: metadataProvider
```xml
<metadataProvider>
<organizationName>Fictitious LTER Site</organizationName>
<address>
<deliveryPoint>Department of Ecology</deliveryPoint>
<deliveryPoint>Fictitious State University</deliveryPoint>
<deliveryPoint>PO Box 111111</deliveryPoint>
<city>Ficity</city>
<administrativeArea>FI</administrativeArea>
<postalCode>11111-1111</postalCode>
</address>
<phone phonetype="voice">(999) 999-9999</phone>
<electronicMailAddress>[email protected]</electronicMailAddress>
<onlineUrl>http://www.fsu.edu/</onlineUrl>
<userId system="https://orcid.org">
https://orcid.org/0000-0000-0000-0000
</userId>
</metadataProvider>
```
### associatedParty
This element is found at this location (XPath):
**/eml:eml/dataset/associatedParty**
List other people who were involved with the data in some way (field
technicians, students assistants, etc.) as <**associatedParty**>. All
<**associatedParty**> trees require a <**role**> element. The parent
university, institution, or agency could also be listed as an
<**associatedParty**> using <**role**> of "owner" when appropriate.
Example 6: associatedParty
```xml
<associatedParty id="12010" system="FLS" scope="system">
<individualName>
<givenName>Ima</givenName>
<surName>Testuser</surName>
</individualName>
<organizationName>FSL LTER</organizationName>
<address>
<deliveryPoint>Department for Ecology</deliveryPoint>
<deliveryPoint>Fictitious State University</deliveryPoint>
<deliveryPoint>PO Box 111111</deliveryPoint>
<city>Ficity</city>
<administrativeArea>FI</administrativeArea>
<postalCode>11111-1111</postalCode>
</address>
<phone phonetype="voice">(999) 999-9999</phone>
<electronicMailAddress>[email protected]</electronicMailAddress>
<onlineUrl>http://search.lternet.edu/directory_view.php?personid=12010&query=itestuser</onlineUrl>
<userId system="https://orcid.org">
https://orcid.org/0000-0000-0000-0000
</userId>
<role>Technician</role>
</associatedParty>
```
### contact
This element is found at this location (XPath):
**/eml:eml/dataset/contact**
A <**contact**> element is required in all EML metadata records. Full
contact information should be included for the position of data manager
or other designated contact, and should be kept current and independent
of personnel changes. If several contacts are listed (e.g. both a data
and site manager) all should be kept current. Technicians who performed
the work belong under <**associatedParty**> rather than
<**contact**>. Complete the <**address**>, <**phone**>,
<**electronicMailAddress**>, and <**onlineURL**> elements for the
<**contact**> element.
Example 7: contact
```xml
<contact>
<positionName id="pos-4">Information Manager</positionName>
<address>
<deliveryPoint>Department for Ecology</deliveryPoint>
<deliveryPoint>Fictitious State University</deliveryPoint>
<deliveryPoint>PO Box 111111</deliveryPoint>
<city>Ficity</city>
<administrativeArea>FI</administrativeArea>
<postalCode>11111-1111</postalCode>
</address>
<phone phonetype="voice">(999) 999-9999</phone>
<electronicMailAddress>[email protected]</electronicMailAddress>
<onlineUrl>http://www.fsu.edu/</onlineUrl>
<userId system="https://orcid.org">
https://orcid.org/0000-0000-0000-0000
</userId>
</contact>
```
### publisher
This element is found at this location (XPath):
**/eml:eml/dataset/publisher**
The organization producing the EML metadata (e.g., an LTER site or field
station) should be placed in the <**publisher**> element. Spell out
the organization's name (<**organizationName**>). Complete the
<**address**>, <**phone**>, <**electronicMailAddress**>, and
<**onlineURL**> elements for each publisher element. Some citation
displays may use this element, although typically, the repository
becomes the publisher in citations.
Example 8: publisher
```xml
<publisher>
<organizationName>Fictitious LTER site</organizationName>
</publisher>
```
## pubDate {-}
This element is found at this location (XPath):
**/eml:eml/dataset/pubDate**
The year of public release of data online should be listed as the
<**pubDate**> element. Because this element may be used in
constructing citations, the **pubDate** also should reflect the
'recentness' of a package, with **pubDate** updated along with
significant revision or data additions (e.g., corrected data, or
additions to an ongoing time series). There is an argument for
**pubDate** referring to original date of release, but this is probably
only useful for static data packages, or if the only metadata changes
are to enhance discovery.
## abstract {-}
This element is found at these locations (XPath):
**/eml:eml/dataset/abstract**
**/eml:eml/dataset/project/abstract**
For a dataset, the abstract element can appear at the resource level or
the project level. The <**abstract**> element will be used for
full-text searches, and it should be rich with descriptive text. In
particular, descriptions should include information that does not fit
into structured metadata, and focus on the "what", "when", and "where"
information, general taxonomic information, as well as whether the
dataset is ongoing or completed. Some general methods description is
appropriate, and broad classes of measured parameters should also be
included. For a large number of parameters, use categories instead of
listing all parameters (e.g. use the term "nutrients" instead of
nitrate, phosphate, calcium, etc.), in combination with the parameters
that seem most relevant for searches.
## keywordSet and keyword {-}
This element is found at these locations (XPath):
**/eml:eml/dataset/keywordSet**
**/eml:eml/dataset/project/keywordSet**
It is recommended that meaningful sets of keywords each be contained
within <**keywordSet**> tag. Use one <**keywordSet**> for a group of
terms identifying the contributing organization(s), e.g., the LTER or
OBFS site, LTREB or Macrosystems project , which is especially if data
are co-funded or funding is leveraged. Meaningful geographic place names
also are appropriate (e.g. state, city, county). If groups of keywords
are from a specific vocabulary, its name belongs the optional tag
<**keywordThesaurus**>.
_Context: Communities sometimes have specific requests for keywords to
assist in searches. E.g, the LTER requests that keywords should include
a LTER core research area(s), the network acronym (LTER, ILTER, etc.),
three-letter site acronym and site name. In addition to specific
keywords, relevant conceptual keywords should also be included, e.g.,
from the LTER Controlled Vocabulary._
Example 9: pubDate, abstract,keywordSet, keyword
```xml
<pubDate>2014</pubDate>
<abstract>
<para>Ground arthropods communities are monitored in different
habitats in a rapidly changing environment. The arthropods are
collected in traps four times a year in ten locations and determined
as far as possible to family, genus or species.</para>
</abstract>
<keywordSet>
<keyword keywordType="place">City</keyword>
<keyword keywordType="place">State</keyword>
<keyword keywordType="place">Region</keyword>
<keyword keywordType="place">County</keyword>
<keyword keywordType="theme">FLS</keyword>
<keyword keywordType="theme">Fictitious LTER Site</keyword>
<keyword keywordType="theme">LTER</keyword>
<keyword keywordType="theme">Arthropods</keyword>
<keyword keywordType="theme">Richness</keyword>
<keywordThesaurus>FLS site thesaurus</keywordThesaurus>
</keywordSet>
<keywordSet>
<keyword keywordType="theme">ecology</keyword>
<keyword keywordType="theme">biodiversity</keyword>
<keyword keywordType="theme">population dynamics</keyword>
<keyword keywordType="theme">terrestrial</keyword>
<keyword keywordType="theme">arthropods</keyword>
<keyword keywordType="theme">pitfall trap</keyword>
<keyword keywordType="theme">monitoring</keyword>
<keyword keywordType="theme">abundance</keyword>
<keywordThesaurus>LTER controlled vocabulary</keywordThesaurus>
</keywordSet>
<keywordSet>
<keyword keywordType="theme">populations</keyword>
<keywordThesaurus>LTER core research areas</keywordThesaurus>
</keywordSet>
```
## intellectualRights {-}
This element is found at this location (XPath):
**/eml:eml/dataset/intellectualRights**
<**intellectualRights**> are controlled at the source, however it is
recommended that data be released with as few restrictions as possible.
Each data package should contain a data access policy, plus a
description of any deviation from the general policy specific for this
particular package (e.g. restricted-access packages). The timeframe for
release should be included as well.
_Context: If no_ <**intellectualRights**> _element is included EDI
will insert text that releases data under "CC-0" (shown in example). The
LTER Network-wide default policy is "CC-BY". Please consult those
organizations for more information and more details._
Example 10: intellectualRights
```xml
<intellectualRights>
<section>
<title>Data Policy</title>
<para>This data package is released to the "public domain" under
Creative Commons CC0 1.0 "No Rights Reserved" (see:
https://creativecommons.org/publicdomain/zero/1.0/). It is considered
professional etiquette to provide attribution of the original work if
this data package is shared in whole or by individual components. A
generic citation is provided for this data package on the website
https://portal.edirepository.org (herein "website") in the summary
metadata page. Communication (and collaboration) with the creators of
this data package is recommended to prevent duplicate research or
publication. This data package (and its components) is made available
"as is" and with no warranty of accuracy or fitness for use. The
creators of this data package and the website shall not be liable for
any damages resulting from misinterpretation or misuse of the data
package or its components. Periodic updates of this data package may
be available from the website. Thank you.</para>
</section>
</intellectualRights>
```
## distribution {-}
This element is found at these locations (XPath):
**/eml:eml/dataset/distribution**
**/eml:eml/dataset/[entity]/physical/distribution**
The <**distribution**> element can appear at both the dataset and
entity levels.
### Dataset level
At the dataset level, the `<distribution>` element should be used for information only,
because it applies to the entire package, not only to one entity.
_Context: The EDI repository will ignore a `<distribution>` element
at the dataset level._
Example 11a: distribution at the dataset level
```xml
<distribution>
<online>
<onlineDescription>f1s-1 Data Web Page</onlineDescription>
<url function="information">
http://www.fsu.edu/lter/data/fls-1.htm
</url>
</online>
</distribution>
```
### Entity level
The entity-level `<distribution>` element contains information on how that
specific data entity (e.g., data table) can be accessed. The <**distribution**> element
has one of three children for describing the location of the resource:
<**online**>, <**offline**>, and <**inline**>.
**Offline Data**: Use the <**offline**> element to describe
restricted access data or data that is not available online. The minimum
that should be included is the <**mediumName**> tag, if using the
<**offline**> element.
**Inline Data**: The <**inline**> element contains data that
is stored directly within the EML document. Data included as text or
string will be parsed. If data are not to be parsed, encode them as
"CDATA sections," by surrounding them with "`<![CDATA[`" and "`]]>`"
tags.
**Online Data**: The <**online**> element has two sub
elements, <**url**>, and <**onlineDescription**> (optional).
<**url**> tags may have an optional attribute named **function**,
which may be set to either "download" or "information". If the
"function" attribute is omitted, then "download" is implied.
**\@function="download"**: accessing the URL directly returns the data
stream
**\@function="information"**: URL leads to a data catalog, intended-use
page, or other page that provides information about downloading the
object but does not directly return the data stream, then the
"function" attribute should be set to "information".
_Context: for am EML data package to be accepted into the EDI
repository, it must include at least one URL; at the entity level (e.g.,
a dataTable at /eml:eml/dataset/dataTable/physical/distribution/url).
The URL must include the function attribute with the value "download"
(or empty, i.e., defaults to "download")._
_Context: The EDI repository system has alternatives for uploading data
entities if you do not have a server which can deliver entities via a URL (http).
Contact EDI for more information on these options._
When used at the entity level, an alternative tag is available to
<**url**>, called <**connection**>. This element is discussed under
data entities, below.
As of EML 2.1, there is also an optional <**access**> element in a
<**distribution**> tree at the data entity level
**(/eml:eml/dataset/[entity]/physical/distribution/access**). This
element is intended specifically for controlling access to the data
entity itself. For more information on the <**access**> tree, see
above, under the general access discussion.
Example 11b: distribution at the data entity level
```xml
<dataTable>
<physical>
...
<distribution>
<online>
<onlineDescription>f1s-1 Data Web Page</onlineDescription>
<url function="download">
http://www.fsu.edu/lter/data/fls-1.csv
</url>
</online>
</distribution>
</physical>
</dataTable>
```
## coverage {-}
This element is found at these locations (XPath):
**/eml:eml/dataset/coverage**
**/eml:eml/dataset/methods/sampling/studyExtent/coverage**
**/eml:eml/dataset/methods/sampling/spatialSamplingUnits/coverage**
**/eml:eml/dataset/[entity]/coverage**
**/eml:eml/dataset/[entity]/methods/sampling/studyExtent/coverage**
**/eml:eml/dataset/[entity]/methods/sampling/spatialSamplingUnits/coverage**
**/eml:eml/dataset/[entity]/attributeList/attribute/coverage**
**/eml:eml/dataset/[entity]/attributeList/attribute/methods/sampling/studyExtent/coverage**
**/eml:eml/dataset/[entity]/attributeList/attribute/methods/sampling/spatialSamplingUnits/coverage**
**/eml:eml/dataset/project/studyAreaDescription/coverage**
The <**coverage**> element can appear at the dataset, methods, entity
and attribute levels, and contains three elements for describing the
coverage in terms of space, taxonomy, and time,
<**geographicCoverage**>, <**taxanomicCoverage**>, and
<**temporalCoverage**>. Populating these elements as recommended
enables advanced searches and understanding. Because they appear at many
XPaths, there are many options for how coverage elements can be used.
### geographicCoverage
**General Information**: The <**geographicCoverage**>
element describes locations of research sites and areas related to the
data, and is intended for general placement of points on a map. It is
recommended to use the element at different levels for different types
of information. The cardinality of the <**geographicCoverage**>
element is one-to-many. The miminum requirement under
<**geographicCoverage**> is two elements, a
<**geographicDescription**> and <**boundingCoordinates**> with a
bounding box containing N, S, E, W limits.
At the dataset level (**eml:eml/dataset/coverage**) one
<**geographicCoverage**> element should be included, whose
<**boundingCoordinates**> describe the extent of the data. As a
default, this could be the nominal boundaries of a sampling area. A more
accurate extent (recommended) would be the maximum extent of the data,
for each of east, west, north and south.
Additional <**geographicCoverage**> elements should be included if
there are significant distances between study sites and grouping them in
one bounding box would be misleading or confusing. For example, a
cross-site study should have bounding boxes for each site.
Example 12: geographicCoverage at the dataset level
```xml
<coverage>
<geographicCoverage>
<geographicDescription>
Ficity, FI metropolitan area, USA
</geographicDescription>
<boundingCoordinates>
<westBoundingCoordinate>-112.373614</westBoundingCoordinate>
<eastBoundingCoordinate>-111.612936</eastBoundingCoordinate>
<northBoundingCoordinate>33.708829</northBoundingCoordinate>
<southBoundingCoordinate>33.298975</southBoundingCoordinate>
<boundingAltitudes>
<altitudeMinimum>300</altitudeMinimum>
<altitudeMaximum>600</altitudeMaximum>
<altitudeUnits>meter</altitudeUnits>
</boundingAltitudes>
</boundingCoordinates>
</geographicCoverage>
</coverage>
```
If sampling took place in discrete point location, those sites should
also appear with or without a bounding box. Individual sampling sites
may also be be entered under <**spatialSamplingUnits**>, each site in
a separate coverage element (see below).
Example 13: geographicCoverage under spatialSamplingUnits
```xml
<spatialSamplingUnits>
<coverage>
<geographicDescription>sitenumber 1</geographicDescription>
<boundingCoordinates>
<westBoundingCoordinate>-112.2</westBoundingCoordinate>
<eastBoundingCoordinate>-112.2</eastBoundingCoordinate>
<northBoundingCoordinate>33.5</northBoundingCoordinate>
<southBoundingCoordinate>33.5</southBoundingCoordinate>
</boundingCoordinates>
</coverage>
<coverage>
<geographicDescription>sitenumber 2</geographicDescription>
<boundingCoordinates>
<westBoundingCoordinate>-111.7</westBoundingCoordinate>
<eastBoundingCoordinate>-111.7</eastBoundingCoordinate>
<northBoundingCoordinate>33.6</northBoundingCoordinate>
<southBoundingCoordinate>33.6</southBoundingCoordinate>
</boundingCoordinates>
</coverage>
<coverage>
<geographicDescription>sitenumber 3</geographicDescription>
<boundingCoordinates>
<westBoundingCoordinate>-112.1</westBoundingCoordinate>
<eastBoundingCoordinate>-112.1</eastBoundingCoordinate>
<northBoundingCoordinate>33.7</northBoundingCoordinate>
<southBoundingCoordinate>33.7</southBoundingCoordinate>
</boundingCoordinates>
</coverage>
</spatialSamplingUnits>
```