Skip to content

Commit 3394b17

Browse files
committed
Updated datasets 2026-04-23 UTC
1 parent 35052ff commit 3394b17

6 files changed

Lines changed: 1043 additions & 984 deletions

File tree

aws_open_datasets.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34546,7 +34546,7 @@
3454634546
"Name": "OpenFold3 Training Data",
3454734547
"Description": "A repository of MSAs and 3D protein structural coordinates used to train OpenFold3",
3454834548
"ARN": "arn:aws:s3:::openfold3-data",
34549-
"Region": "us-east-2",
34549+
"Region": "us-west-2",
3455034550
"Type": "S3 Bucket",
3455134551
"Documentation": "https://portal.openfold.omsf.io/datasets",
3455234552
"Contact": "https://github.com/aqlaboratory/openfold-3/issues",

aws_open_datasets.tsv

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1254,7 +1254,7 @@ OpenCRAVAT OpenCRAVAT Store EU arn:aws:s3:::opencravat-store-eu-west-2 eu-west-2
12541254
OpenCRAVAT OpenCRAVAT Store US arn:aws:s3:::opencravat-store-aws us-east-1 S3 Bucket https://open-cravat.readthedocs.io support@opencravat.org KarchinLab, Potomac IT Group Data is mirrored daily. Update frequencies of individual annotators depend on th "License varies per-annotator. Commercial users must check the ""commercial_warnin" aws-pds, genetic, genomic, life sciences, variant annotation, sqlite, tertiary analysis
12551255
OpenCell on AWS Live-cell confocal fluorescence microscopy images of the OpenCell library of flu arn:aws:s3:::czb-opencell us-west-2 S3 Bucket https://opencell.czbiohub.org/download opencell@czbiohub.org [Chan Zuckerberg Biohub](https://www.czbiohub.org/) This is the final version of the dataset. CC BY-SA 4.0 aws-pds, biology, cell biology, life sciences, imaging, cell imaging, fluorescence imaging, microscopy, computer vision, machine learning
12561256
OpenEEW OpenEEW arn:aws:s3:::grillo-openeew us-east-1 S3 Bucket https://github.com/openeew/openeew hello@openeew.com [Grillo](https://grillo.io/) Approximately every 5 minutes https://github.com/openeew/openeew#license disaster response, earth observation, earthquakes, aws-pds ['[Browse Bucket](https://grillo-openeew.s3.amazonaws.com/index.html)']
1257-
OpenFold3 Training Data A repository of MSAs and 3D protein structural coordinates used to train OpenFol arn:aws:s3:::openfold3-data us-east-2 S3 Bucket https://portal.openfold.omsf.io/datasets https://github.com/aqlaboratory/openfold-3/issues OpenFold Consortium Never [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) openfold, msa, protein, protein template, protein folding, open source software, life sciences, aws-pds
1257+
OpenFold3 Training Data A repository of MSAs and 3D protein structural coordinates used to train OpenFol arn:aws:s3:::openfold3-data us-west-2 S3 Bucket https://portal.openfold.omsf.io/datasets https://github.com/aqlaboratory/openfold-3/issues OpenFold Consortium Never [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) openfold, msa, protein, protein template, protein folding, open source software, life sciences, aws-pds
12581258
OpenNeuro MRI, MEG, EEG, iEEG, and ECoG datasets from OpenNeuro arn:aws:s3:::openneuro.org us-east-1 S3 Bucket http://openneuro.org Support form at https://openneuro.org [Stanford University Center for Reproducible Neuroscience](https://reproducibili New datasets deposited every 4-6 days CC0 aws-pds, biology, imaging, life sciences, neurobiology, neuroimaging, neuroscience
12591259
OpenProteinSet A repository of MSAs and template hits arn:aws:s3:::openfold us-east-1 S3 Bucket https://docs.google.com/document/d/1R90-VJSLQEbot7tgXF3zb068Y1ZJAmsckQ_t2sJTv2c/ https://github.com/aqlaboratory/openfold/issues OpenFold Never [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) openfold, msa, protein, protein template, protein folding, alphafold, open source software, life sciences, aws-pds
12601260
OpenRoboCare Multi-Modal Expert Demonstration Dataset for Robot-Assisted Caregiving Full Dataset arn:aws:s3:::open-robo-care us-west-2 S3 Bucket https://emprise.cs.cornell.edu/robo-care/docs https://emprise.cs.cornell.edu/robo-care/ [EmPRISE Lab at Cornell University](https://emprise.cs.cornell.edu/) Static dataset - no regular updates planned BSD-3-Clause license - Academic and non-commercial use permitted. See documentat computer vision, robotics, machine learning, health, aws-pds, life sciences

0 commit comments

Comments
 (0)