Detailed instructions¶
Warning
The catalogue is in testing phase. Any metadata uploaded in the catalogue is not yet backed up. This guide is still under development.
Phase 0¶
-
Understand the basic structure of the metadata model.
-
Decide on the what experiments to group into (the investigation and) the study.
-
Choose the study type (sequencing, phenotyping or combined). Inform DataXR well up front when your study does not fit these types.
-
Make sure that you have registered your study in this form. After the admin has checked the registration, you will receive a study id you need when entering the metadata, as well as storage for your experimental data.
-
Create an account for the catalogue and log in. [Write about normal or KeyCloak?]
Request to be added to a project. This is done by a project admin.
-
Gather the files with the metadata that you already collected. This includes e-lab journal entries, files that outline experimental conditions per plant, files that link output files to the experiment, etc.
Phase 1¶
-
Log in to the FairdomSEEK webpage.
-
Find or create an investigation.
- To find an investigation, click on “🔎Browse” (click the menu first when using a narrow window) in the top left corner and select “Investigations”.
- If your investigation is not yet registered, click on “➕Create” and select “Investigations”. Add a Title and select a Project and click on “Create”.
-
From within the investigation, create a new study by clicking “➕Design Study” at the top.
- Create a tile using the naming convention.
-
Choose the extended metadata type that is relevant for your study.
Of the extended metadata, only fill the mandatory fields for now (to be able to save). The other fields can still be entered later. Quickly skim the fields so you are aware of what is collected on study level.
-
For now, skip the fields “Study position”, “Sharing”, “Creators”, “Publications”, and “Discussion Channels”. The fields “Sharing”, “Creators”, “Publications” can be reviewed and modified later.
-
At “Define Sample type for Source”, select an “Existing template”. Choose the default template “CropXR source” and click “Apply”. Now review the predefined parameters that are collected for the study source. These are the fields that will be used to describe your experimental setup and conditions. If there are any missing, you can add them at the bottom by selecting “➕Add new attribute”. Make sure that the column “ISA Tag” is set to “source_characteristic”. Additional fields can also be added later.
In MIAPPE there are many suggestions for environmental parameters and experimental factors to collect. Please reference these lists, to get a predictable field name.
Please do not remove any fields; this will make your study harder to find. All fields not relevant to your study can be left empty.

-
Skip “SOPs” for now, these will be added later.
-
At “Define Sample type for Sample” at select an “Existing template” you need to “Choose a template” in the drop down selection. Now select, depending on the units of measurement in your study: “CropXR sample” when there are samples only, “CropXR sample or observation unit” if there are both samples in your study as well as measurements on a different level, and “CropXR observation unit” when there are no samples. Choose the latest version of the template you need, and click “Apply”. Here you can also check the fields and add additional fields when needed, as describe for the source, while it is likely not needed.
-
Click on “🟦Create”
-
Make a plan for how to define the Assay Streams and Assays. (read here)
-
Click on “➕Design Assay Stream” at the top of the study, enter a title (following the naming convention).
If needed, select the “Extended metadata” for the assay stream. Skip all other fields for now and click on “🟦Create”.
- From the created Assay Stream, create an assay by clicking “➕Design Assay”
- Enter a title (following the naming convention)
- For now skip the fields “Sharing”, “Creators”, “SOPs”, “Publications”, “Documents” and “Channel discussions”.
- At “Define Sample type for Assay” start at the “Existing Templates”. Depending on the type of assay you are making (based on the plan made in step 4), change the “ISA Level” drop down to “assay - data file” or leave it as is. Choose the latest version of the relevant template in the drop down menu and “Apply”.
- The sample type can be expanded with additional columns that should be included as metadata. This is relevant for assays where most parameters are kept constant, but some are varied between measurements/samples. If the fields is related to the assay performed, the column “ISA Tag” should be set to “parameter_value”, if it is about the output data file set it to “data_file_characteristic”.
- If needed (not included already in the assay), create the next assay of the type data file for the raw data.
- If needed, create the next assay of the type data file for the derived data.
Now you have the outline of your metadata structure. The actual metadata can be uploaded.
Phase 2¶
Not all steps need to happen in this exact order, but some steps are dependent on each other: study sources need to be created before study samples and study samples before assay row entries. Data files need to be registered before they can be referenced.
-
Register your data files
- At the top menu under “➕Create” choose “Data file”
- Choose how you want to register your data. Find the URL at the data location (Research Drive).
- Under the tab “Remote URL” past the URL of the data location and “🟦Register”. Do not mind a warning about the URL in a yellow box. If there is an error and the URL cannot be registered please check if the URL is correct.
- Fill in a “Title”, a “Description”, select a “Project”, click “🟦Next, and “🟦Next”
- Select a license. The license can be adapted later if needed. At no license the consortium agreement applies to all people users that this data is shared with.
- Adapt the sharing of the data, either on group level or per group. At a later stage, the permissions set here automatically apply to where the data is stored. For now, that is still handled separately in the research drive.
- Skip “Associated Assays” and “Other associated items”by clicking “🟦Next, and “🟦Create”
-
Add SOPs (Standard operating procedure) (can also be done later). Each step can reference a protocol that was used: creation of samples and the grouping of observation units at the study level, the assaying protocols at assay level and the data transformation steps.
- At the top menu under “➕Create” choose “SOP”
- Click “Browse” to upload a local file.
- Add a “Title”. This needs to be specific enough to find the SOP back between other SOPs from different studies.
- Add a “Description”, select a “Project” and a “License”. Skip “Discussion Channels”, adapt “Sharing”, skip “Creators”, “Tags” and “Attributions”
- If the SOP is related to an assay, it can be linked under “Experimental assays and Modelling analyses”. This can also be done at the assay.
- If a data processing step is described by a registered workflow/processing pipeline, it can be linked under “Workflows”.
- Click “🟦Register”
-
Edit the study by going to “⚙️Actions” in the top corner and then “📝Edit ISA Study”
- Fill in the description
- Fill in extended metadata. Focus on the fields most relevant to understand the study. Skip the fields that do not apply. The fields can always be revised later.
- Under “SOPs” select the SOP(s) that describe the sampling and the experimental design map.
- Click “🟦Update” to apply the changes.
-
Define study source samples
- Choose how to group/define the sources.
- Go to the “Sources table”, by clicking the tab “Study design” (or from the “Single page” view find the “Sources table” in the left menu). Here you find a table with the columns that you have defined.
- Download the template by clicking the button “Batch download to Excel”.
- In the excel, under the Samples tab fill in the data in the fields.
- Ignore the first two columns.
- The Source Name is the name that will be displayed.
- Start with the most relevant fields to understand the study and the sources used. The data can be improved on at a later point. Fill in at least the species and the experimental group.
-
Save the file. Upload it by under “Upload excel spreadsheet” select “Browse” and click “🟦Upload”. Now there might be an error message. Please read it carefully and adjust the data accordingly.
Be aware, if you upload the same excel multiple times, a new sample will be created with the same name. To check how to update existing samples, check the phase 3 instructions.
-
Define study samples
- Choose what type are needed.
- Go to “Samples table” under “Study design”(or from the “Single page” view find the “Samples table” in the left menu).
- Download the template by clicking the button “Batch download to Excel”.
- In the excel, under the Samples tab fill in the data in the fields.
- Ignore the first two columns.
- Use the Input column to link to a source defined in the previous step.
- The subject_id is the name that will be displayed.
- There is a mandatory column called protocol. The text should refer to a registered SOP.
- Start with the most relevant fields. The data can be improved on at a later point.
- Save the file. Upload it by under “Upload excel spreadsheet” select “Browse” and click “🟦Upload”. Now there might be an error message. Please read it carefully and adjust the data accordingly.
-
Fill in the assay stream extended metadata by going to “⚙️Actions” in the top corner and then “📝Edit Assay Stream”. Focus on the fields that are most important and “🟦Update” to save.
-
For each assay defined in phase 1, enter the row data, the same way as the study source and sample: download the template, fill in the data, save and upload the template.
- For the first assay of an assay stream the input should be a study sample (so this is a sample of observation unit). For additional assays the input is an output of the previous assay.
- To link a registered data file as file location, use the required format.
- For the file name, use the relative path of the exact file inside the registered file location.
Phase 3¶
- Update the sharing permissions that each of the created elements have. Think about who should see your study. If you are collaborating with others you can give them edit permission. You can edit them by navigating to the Study/Assay/DataFile/SOP, clicking “⚙️Actions” in the top corner and then “🔧Manage ..” After making the changes make sure to “🟦Update”
Note that in the future the permission you set in SEEK on the data file will be automatically set where the data is stored, but for now the permission is still set separately within the Research Drive.
-
Update the study and assay extended metadata to make the metadata more complete (under “ “⚙️Actions” in the top corner and then “📝Edit ..”)
-
Updating existing samples [todo: expand]
- make sure it’s selected before downloaded, so there is no new sample with the same name
- You can upload multiple in multiple steps, but be aware that if you re-upload samples with the same name, a new sample will be created. Only upload new samples.
-
Adding additional columns [todo: expand] (under “ “⚙️Actions” in the top corner and then “📝Edit ..”)