Skip to content

Commit eaea4df

Browse files
committed
Updated datasets 2025-07-03 UTC
1 parent b35fbf0 commit eaea4df

8 files changed

Lines changed: 3125 additions & 3171 deletions

aws_open_datasets.json

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3092,7 +3092,7 @@
30923092
"Contact": "brock.wester@jhuapl.edu",
30933093
"ManagedBy": "[Johns Hopkins University Applied Physics Laboratory](https://https://jhuapl.edu)",
30943094
"UpdateFrequency": "New datasets are added as soon as it is available. Minor updates on existing datasets occur sporadically.",
3095-
"License": "Creative Commons 4.0 International (CC BY 4.0)",
3095+
"License": "Creative Commons 4.0 International (CC BY 4.0); Creative Commons CC0 1.0 Universal (CC0-1.0)",
30963096
"Tags": [
30973097
"aws-pds",
30983098
"life sciences",
@@ -31421,7 +31421,7 @@
3142131421
"Type": "S3 Bucket",
3142231422
"Documentation": "https://docs.sparc.science",
3142331423
"Contact": "joostw@seas.upenn.edu, support@sparc.science",
31424-
"ManagedBy": "[The SPARC Data Resource Center](https://sparc.science/about)",
31424+
"ManagedBy": "[The SPARC Data and Resource Center](https://sparc.science/about)",
3142531425
"UpdateFrequency": "Continually adding new datasets and releasing versions of datasets",
3142631426
"License": "CC-BY-4.0",
3142731427
"Tags": [

aws_open_datasets.tsv

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -112,7 +112,7 @@ Blue Brain Open Data Data files representing neurological tissue structures arn:
112112
BodyM Dataset This S3 bucket has height, weight, gender, measurements and two silhouette image arn:aws:s3:::amazon-bodym us-west-2 S3 Bucket https://adversarialbodysim.github.io/ Post any questions to [re:Post](https://repost.aws/tags/questions/TApd0Wl5P8S9O6 Amazon None Creative Commons Attribution-Non Commercial 4.0 International Public License - h amazon.science, computer vision, deep learning
113113
Boltz-1 Training Data S3 Bucket containing the Boltz-1 data arn:aws:s3:::boltz1 us-east-2 S3 Bucket https://github.com/jwohlwend/boltz/blob/main/docs/training.md jwohlwend@csail.mit.edu MIT CSAIL - Regina Barzilay Group None MIT License deep learning, protein folding, molecular docking, open source software, life sciences
114114
Boreas Autonomous Driving Dataset Boreas dataset arn:aws:s3:::boreas us-west-2 S3 Bucket https://github.com/utiasASRL/pyboreas/blob/master/DATA_REFERENCE.md boreas@robotics.utias.utoronto.ca [ASRL](http://asrl.utias.utoronto.ca) New driving sequences will be added as they are collected. [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/legalcode) autonomous vehicles, robotics, computer vision, lidar, aws-pds
115-
BossDB Open Neuroimagery Datasets Large 3D volumes of neuroimaging data and image processing products such as segm arn:aws:s3:::bossdb-open-data us-east-1 S3 Bucket https://bossdb.org/ brock.wester@jhuapl.edu [Johns Hopkins University Applied Physics Laboratory](https://https://jhuapl.edu New datasets are added as soon as it is available. Minor updates on existing dat Creative Commons 4.0 International (CC BY 4.0) aws-pds, life sciences, imaging, neuroscience, neuroimaging, electron microscopy, x-ray tomography, x-ray microtomography, x-ray, magnetic resonance imaging, light-sheet microscopy, calcium imaging, volumetric imaging
115+
BossDB Open Neuroimagery Datasets Large 3D volumes of neuroimaging data and image processing products such as segm arn:aws:s3:::bossdb-open-data us-east-1 S3 Bucket https://bossdb.org/ brock.wester@jhuapl.edu [Johns Hopkins University Applied Physics Laboratory](https://https://jhuapl.edu New datasets are added as soon as it is available. Minor updates on existing dat Creative Commons 4.0 International (CC BY 4.0); Creative Commons CC0 1.0 Univers aws-pds, life sciences, imaging, neuroscience, neuroimaging, electron microscopy, x-ray tomography, x-ray microtomography, x-ray, magnetic resonance imaging, light-sheet microscopy, calcium imaging, volumetric imaging
116116
Brain Encoding Response Generator (BERG) Pre-trained neural encoding models arn:aws:s3:::brain-encoding-response-generator us-west-2 S3 Bucket https://brain-encoding-response-generator.readthedocs.io/en/latest/index.html brain.berg.info@gmail.com [Neural Dynamics of Visual Cognition](https://www.ewi-psy.fu-berlin.de/en/psycho Updates will be released on a per-model or dataset basis as new models are train CC BY-NC 4.0. For Terms & Conditions, see https://brain-encoding-response-genera neuroscience, machine learning, brain models, deep learning, neuroimaging, computer vision, life sciences ['[Browse Bucket](https://brain-encoding-response-generator.s3.us-west-2.amazonaws.com/index.html)']
117117
Brain/MINDS Marmoset Connectivity Resource on AWS NIfTI data for marmoset brains and metadata arn:aws:s3:::brainminds-marmoset-connectivity ap-northeast-1 S3 Bucket https://bmcr.brainminds.jp/ https://dataportal.brainminds.jp/contact-us [RIKEN Center for Brain Science](https://cbs.riken.jp) As new data become available. Creative Commons Attribution 4.0 International brain images, imaging, microscopy, neurobiology, neuroimaging, neuroscience, nifti, non-human primate, life sciences
118118
Broad Genome References This dataset includes two human genome references assembled by the Genome Refere arn:aws:s3:::broad-references us-east-1 S3 Bucket https://s3.amazonaws.com/broad-references/broad-references-readme.html hensonc@broadinstitute.org Broad Institute Monthly CC0 1.0 Universal (CC0 1.0) Public Domain Dedication aws-pds, biology, bioinformatics, cancer, genetic, genomic, life sciences, reference index, Homo sapiens
@@ -1149,7 +1149,7 @@ SILAM Air Quality Notifications for new netcdf surface data arn:aws:sns:eu-west-
11491149
SILAM Air Quality Notifications for new zarr surface data arn:aws:sns:eu-west-1:916174725480:new-fmi-opendata-silam-surface-zarr eu-west-1 SNS Topic http://en.ilmatieteenlaitos.fi/open-data-on-aws-s3 avoin-data@fmi.fi [Finnish Meteorological Institute](https://www.ilmatieteenlaitos.fi/) 1 time a day Creative Commons Attribution 4.0 International (CC BY 4.0) aws-pds, earth observation, climate, weather, air quality, meteorological
11501150
SILO climate data on AWS SILO open data arn:aws:s3:::silo-open-data ap-southeast-2 S3 Bucket https://www.longpaddock.qld.gov.au/silo/gridded-data https://www.longpaddock.qld.gov.au/silo/contact-us Queensland Government Daily SILO datasets are constructed by the [Queensland Government](http://www.qld.gov. aws-pds, agriculture, climate, earth observation, environmental, meteorological, model, sustainability, water, weather
11511151
SMN Hi-Res Weather Forecast over Argentina WRF SMN data arn:aws:s3:::smn-ar-wrf us-west-2 S3 Bucket General information, tutorials and examples:[https://odp-aws-smn.github.io/docum For any questions regarding the data set or any general questions, you can conta [SMN](http://www.smn.gov.ar/) New data is added as soon as it's available. Two forecast cycles a day initializ [Creative Commons Attribution 2.5 Argentina License](https://creativecommons.org aws-pds, earth observation, natural resource, weather, meteorological ['[Browse Bucket](https://smn-ar-wrf.s3.amazonaws.com/index.html)']
1152-
SPARC: Datasets bridging the body and the brain SPARC Public Datasets S3 Bucket arn:aws:s3:::sparc-prod-aod-discover-publish50-use1 us-east-1 S3 Bucket https://docs.sparc.science joostw@seas.upenn.edu, support@sparc.science [The SPARC Data Resource Center](https://sparc.science/about) Continually adding new datasets and releasing versions of datasets CC-BY-4.0 bioinformatics, life sciences, microscopy, neurophysiology, neuroscience, electrophysiology ['[SPARC Datasets](https://sparc.science/data?type=dataset)'] False
1152+
SPARC: Datasets bridging the body and the brain SPARC Public Datasets S3 Bucket arn:aws:s3:::sparc-prod-aod-discover-publish50-use1 us-east-1 S3 Bucket https://docs.sparc.science joostw@seas.upenn.edu, support@sparc.science [The SPARC Data and Resource Center](https://sparc.science/about) Continually adding new datasets and releasing versions of datasets CC-BY-4.0 bioinformatics, life sciences, microscopy, neurophysiology, neuroscience, electrophysiology ['[SPARC Datasets](https://sparc.science/data?type=dataset)'] False
11531153
SPARTAN Data All data products (PM25, aerosol chemical components, scattering) provided by S arn:aws:s3:::spartan-cloud us-west-2 S3 Bucket https://www.spartan-network.org/data SPARTAN.PM25@gmail.com The [Atmospheric Composition Analysis Group](https://sites.wustl.edu/acag/) New measurement or estimation products will be added when available, usually mul SPARTAN data is licensed under [CC BY 4.0](https://creativecommons.org/licenses/ aws-pds, environmental, air quality
11541154
SPaRCNet data:Seizures, Rhythmic and Periodic Patterns in ICU Electroencephalography ICU EEG Dataset arn:aws:s3:us-east-1:184438910517:accesspoint/bdsp-sparcnet-access-point us-east-1 S3 Bucket More documentation can be found [here](https://doi.org/10.60508/cw6j-s785) admin@bdsp.io [Brain Data Science Platform](https://bdsp.io/) New data is added as soon as it is available. "BDSP Restricted Health Data License 1.0.0 ""[BDSP Licence](https://bdsp.io/conten" aws-pds, neurophysiology, medicine, machine learning, neuroscience, deep learning, life sciences, bioinformatics https://doi.org/10.60508/cw6j-s785
11551155
SSL4EO S12 Landsat Multi Product Dataset Satellite imagery and context from Sentinel-12 and Landsat 4-5, 7, 8-9 arn:aws:s3:::ssl4eo-s12-landsat-combined us-west-2 S3 Bucket https://github.com/sunny1401/ssl4eo_multi_satellite_products https://github.com/sunny1401/ssl4eo_multi_satellite_products Sankranti Joshi Not updated https://creativecommons.org/licenses/by-nc-sa/4.0/ satellite imagery

aws_stac_catalogs.json

Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -503,6 +503,15 @@
503503
"Type": "S3 Bucket",
504504
"RequesterPays": null
505505
},
506+
{
507+
"Name": "Sentinel-1 Monthly Mosaic",
508+
"Endpoint": "https://explorer.digitalearth.africa/products/s1_monthly_mosaic",
509+
"Description": "S1 Monthly Mosaic tiles and metadata",
510+
"ARN": "arn:aws:s3:::deafrica-sentinel-1",
511+
"Region": "af-south-1",
512+
"Type": "S3 Bucket",
513+
"RequesterPays": false
514+
},
506515
{
507516
"Name": "Sentinel-2 - Level 1C scenes and metadata",
508517
"Endpoint": "https://earth-search.aws.element84.com/v1/collections/sentinel-2-l1c",

aws_stac_catalogs.tsv

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -55,6 +55,7 @@ Reference Elevation Model of Antarctica (REMA) - REMA DEM Mosaics https://polarg
5555
Reference Elevation Model of Antarctica (REMA) - REMA DEM Strips https://polargeospatialcenter.github.io/stac-browser/#/external/pgc-opendata-dems.s3.us-west-2.amazonaws.com/rema/strips.json REMA DEM Strips arn:aws:s3:::pgc-opendata-dems/rema/strips/ us-west-2 S3 Bucket
5656
Satellogic EarthView dataset https://satellogic-earthview.s3.us-west-2.amazonaws.com/stac/catalog.json Satellogic data includes TOA RGBN COG, VISUAL RGB COG files data and metadata arn:aws:s3:::satellogic-earthview us-west-2 S3 Bucket False
5757
Sentinel Near Real-time Canada Mirror https://www.eodms-sgdot.nrcan-rncan.gc.ca/stac/ Sentinel data over Canada | Données sentinelles au Canada. arn:aws:s3:::sentinel-products-ca-mirror ca-central-1 S3 Bucket
58+
Sentinel-1 Monthly Mosaic https://explorer.digitalearth.africa/products/s1_monthly_mosaic S1 Monthly Mosaic tiles and metadata arn:aws:s3:::deafrica-sentinel-1 af-south-1 S3 Bucket False
5859
Sentinel-2 - Level 1C scenes and metadata https://earth-search.aws.element84.com/v1/collections/sentinel-2-l1c Level 1C scenes and metadata, in [Requester Pays](https://docs.aws.amazon.com/AmazonS3/latest/dev/RequesterPaysBuckets.html) S3 bucket arn:aws:s3:::sentinel-s2-l1c eu-central-1 S3 Bucket True
5960
Sentinel-2 - Level 2A scenes and metadata https://sentinel-s2-l2a-stac.s3.amazonaws.com/ Level 2A scenes and metadata, in [Requester Pays](https://docs.aws.amazon.com/AmazonS3/latest/dev/RequesterPaysBuckets.html) S3 bucket arn:aws:s3:::sentinel-s2-l2a eu-central-1 S3 Bucket True
6061
Sentinel-2 Cloud-Optimized GeoTIFFs - Collection 1 Level 2A scenes and metadata https://earth-search.aws.element84.com/v1/collections/sentinel-2-c1-l2a Collection 1 Level 2A scenes and metadata arn:aws:s3:::e84-earth-search-sentinel-data us-west-2 S3 Bucket False

0 commit comments

Comments
 (0)