Skip to content

Commit cd46d7d

Browse files
committed
Updated datasets 2026-03-20 UTC
1 parent dea7403 commit cd46d7d

6 files changed

Lines changed: 903 additions & 895 deletions

File tree

aws_open_datasets.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33825,7 +33825,7 @@
3382533825
{
3382633826
"Name": "OpenFold3 Training Data",
3382733827
"Description": "A repository of MSAs and 3D protein structural coordinates used to train OpenFold3",
33828-
"ARN": "arn:aws:s3:::openfold3",
33828+
"ARN": "arn:aws:s3:::openfold3-data",
3382933829
"Region": "us-east-2",
3383033830
"Type": "S3 Bucket",
3383133831
"Documentation": "https://github.com/aqlaboratory/openfold3-training-data-RODA/tree/main",

aws_open_datasets.tsv

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1225,7 +1225,7 @@ OpenCRAVAT OpenCRAVAT Store EU arn:aws:s3:::opencravat-store-eu-west-2 eu-west-2
12251225
OpenCRAVAT OpenCRAVAT Store US arn:aws:s3:::opencravat-store-aws us-east-1 S3 Bucket https://open-cravat.readthedocs.io support@opencravat.org KarchinLab, Potomac IT Group Data is mirrored daily. Update frequencies of individual annotators depend on th "License varies per-annotator. Commercial users must check the ""commercial_warnin" aws-pds, genetic, genomic, life sciences, variant annotation, sqlite, tertiary analysis
12261226
OpenCell on AWS Live-cell confocal fluorescence microscopy images of the OpenCell library of flu arn:aws:s3:::czb-opencell us-west-2 S3 Bucket https://opencell.czbiohub.org/download opencell@czbiohub.org [Chan Zuckerberg Biohub](https://www.czbiohub.org/) This is the final version of the dataset. CC BY-SA 4.0 aws-pds, biology, cell biology, life sciences, imaging, cell imaging, fluorescence imaging, microscopy, computer vision, machine learning
12271227
OpenEEW OpenEEW arn:aws:s3:::grillo-openeew us-east-1 S3 Bucket https://github.com/openeew/openeew hello@openeew.com [Grillo](https://grillo.io/) Approximately every 5 minutes https://github.com/openeew/openeew#license disaster response, earth observation, earthquakes, aws-pds ['[Browse Bucket](https://grillo-openeew.s3.amazonaws.com/index.html)']
1228-
OpenFold3 Training Data A repository of MSAs and 3D protein structural coordinates used to train OpenFol arn:aws:s3:::openfold3 us-east-2 S3 Bucket https://github.com/aqlaboratory/openfold3-training-data-RODA/tree/main https://github.com/aqlaboratory/openfold3-training-data-RODA/issues OpenFold Never [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) openfold, msa, protein, protein template, protein folding, open source software, life sciences, aws-pds
1228+
OpenFold3 Training Data A repository of MSAs and 3D protein structural coordinates used to train OpenFol arn:aws:s3:::openfold3-data us-east-2 S3 Bucket https://github.com/aqlaboratory/openfold3-training-data-RODA/tree/main https://github.com/aqlaboratory/openfold3-training-data-RODA/issues OpenFold Never [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) openfold, msa, protein, protein template, protein folding, open source software, life sciences, aws-pds
12291229
OpenNeuro MRI, MEG, EEG, iEEG, and ECoG datasets from OpenNeuro arn:aws:s3:::openneuro.org us-east-1 S3 Bucket http://openneuro.org Support form at https://openneuro.org [Stanford University Center for Reproducible Neuroscience](https://reproducibili New datasets deposited every 4-6 days CC0 aws-pds, biology, imaging, life sciences, neurobiology, neuroimaging
12301230
OpenProteinSet A repository of MSAs and template hits arn:aws:s3:::openfold us-east-1 S3 Bucket https://docs.google.com/document/d/1R90-VJSLQEbot7tgXF3zb068Y1ZJAmsckQ_t2sJTv2c/ https://github.com/aqlaboratory/openfold/issues OpenFold Never [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) openfold, msa, protein, protein template, protein folding, alphafold, open source software, life sciences, aws-pds
12311231
OpenRoboCare Multi-Modal Expert Demonstration Dataset for Robot-Assisted Caregiving Full Dataset arn:aws:s3:::open-robo-care us-west-2 S3 Bucket https://emprise.cs.cornell.edu/robo-care/docs https://emprise.cs.cornell.edu/robo-care/ [EmPRISE Lab at Cornell University](https://emprise.cs.cornell.edu/) Static dataset - no regular updates planned BSD-3-Clause license - Academic and non-commercial use permitted. See documentat computer vision, robotics, machine learning, health, aws-pds, life sciences

0 commit comments

Comments
 (0)