Amazon, Berkeley release dataset of product images and metadata

海外精选
海外精选的内容汇集了全球优质的亚马逊云科技相关技术内容。同时,内容中提到的“AWS” 是 “Amazon Web Services” 的缩写,在此网站不作为商标展示。
0
0
{"value":"Researchers in Amazon’s Seller Partner Services organization, together with colleagues at the University of California, Berkeley, have publicly released a [massive dataset of product images](https://amazon-berkeley-objects.s3.amazonaws.com/index.html) and associated metadata to support research on product-related information management, information retrieval, and visual understanding.\n\nThe dataset could, for instance, help enable new, more-powerful AI models for [image-based shopping](https://www.amazon.science/blog/how-computer-vision-will-help-amazon-customers-shop-online) or expansion of retailers’ [product graphs](https://www.amazon.science/blog/thewebconf-where-communities-converge-on-questions-of-scale).\n\n“Computer vision is an empirically driven field, and its progress has been shaped by datasets,” says Jitendra Malik, the Arthur J. Chick Professor of Electrical Engineering and Computer Science at Berkeley, whose group helped develop the dataset. “In traditional image or video collections, we have very little information about a specific object — typically just a category label, like ‘chair’. For the objects in ABO, we have attributes, multiple views, and CAD models. This lets us infer much more from an image. We expect this will advance research in multiple areas of computer vision, particularly 3-D inference.”\n\n![下载.gif](https://dev-media.amazoncloud.cn/7807f8f8a52a4eb5bd69e22341c059e0_%E4%B8%8B%E8%BD%BD.gif)\n\nSample exploration of one of the 7,900 3-D product models in the Amazon Berkeley Objects Dataset (ABO).\nCREDIT: IMAGE SOURCE, ABO; ANIMATION BY GUPTA MEDIA\n\nDubbed the Amazon Berkeley Objects Dataset, or ABO, the dataset includes images of 147,702 products, all annotated with metadata such as multilingual title, brand, model, year, product type, dimensions, and material. There are 398,212 static catalogue images; 8,200 images that provide 360° rotations in the plane at 5° intervals (for a total of 72 perspectives per product); and 7,900 fully 3-D product models that can be rotated along any axis and rendered in any 3-D environment under any lighting conditions.\n\n#### **Registry of Open Data on AWS**\n\nExplore other Amazon datasets on the [Registry of Open Data on AWS](https://registry.opendata.aws/tag/amazon.science/), home to more than 268 publicly available datasets, including datasets useful to earth science, life sciences, and sustainability. Some examples of datasets are the [U.S. NIH Sequence Read Archive](https://aws.amazon.com/blogs/industries/nihs-sequence-read-archive-the-worlds-largest-genome-sequence-repository-openly-accessible-on-aws/), the [USGS Landsat program](https://registry.opendata.aws/noaa-gefs-reforecast/), and [NOAA’s Global Ensemble Forecast System](https://registry.opendata.aws/noaa-gefs-reforecast/).\n\nThe dataset is licensed under the Creative Commons Attribution-NonCommercial 4.0 International Public License (CC BY-NC 4.0), which prohibits commercial use of the dataset but is otherwise nonrestrictive.\n\nCollectively — still images, thumbnails of the still images, 360° rotations, and 3-D models — the size of the dataset is almost 300 gigabytes. On the [dataset website](https://amazon-berkeley-objects.s3.amazonaws.com/index.html), researchers can download the entire dataset; browse product images filtered according to product name or type; or learn how to use a version of the dataset hosted by Amazon Web Services without having to download it. \n\n“Data has become the most important component of AI and machine learning,” says Matthieu Guillaumin, a senior applied scientist at Amazon who helped develop the dataset. “During this project, we have been driven by the hope to spark major innovations in product understanding that have the potential to benefit shoppers worldwide.”","render":"<p>Researchers in Amazon’s Seller Partner Services organization, together with colleagues at the University of California, Berkeley, have publicly released a <a href=\"https://amazon-berkeley-objects.s3.amazonaws.com/index.html\" target=\"_blank\">massive dataset of product images</a> and associated metadata to support research on product-related information management, information retrieval, and visual understanding.</p>\n<p>The dataset could, for instance, help enable new, more-powerful AI models for <a href=\"https://www.amazon.science/blog/how-computer-vision-will-help-amazon-customers-shop-online\" target=\"_blank\">image-based shopping</a> or expansion of retailers’ <a href=\"https://www.amazon.science/blog/thewebconf-where-communities-converge-on-questions-of-scale\" target=\"_blank\">product graphs</a>.</p>\n<p>“Computer vision is an empirically driven field, and its progress has been shaped by datasets,” says Jitendra Malik, the Arthur J. Chick Professor of Electrical Engineering and Computer Science at Berkeley, whose group helped develop the dataset. “In traditional image or video collections, we have very little information about a specific object — typically just a category label, like ‘chair’. For the objects in ABO, we have attributes, multiple views, and CAD models. This lets us infer much more from an image. We expect this will advance research in multiple areas of computer vision, particularly 3-D inference.”</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/7807f8f8a52a4eb5bd69e22341c059e0_%E4%B8%8B%E8%BD%BD.gif\" alt=\"下载.gif\" /></p>\n<p>Sample exploration of one of the 7,900 3-D product models in the Amazon Berkeley Objects Dataset (ABO).<br />\nCREDIT: IMAGE SOURCE, ABO; ANIMATION BY GUPTA MEDIA</p>\n<p>Dubbed the Amazon Berkeley Objects Dataset, or ABO, the dataset includes images of 147,702 products, all annotated with metadata such as multilingual title, brand, model, year, product type, dimensions, and material. There are 398,212 static catalogue images; 8,200 images that provide 360° rotations in the plane at 5° intervals (for a total of 72 perspectives per product); and 7,900 fully 3-D product models that can be rotated along any axis and rendered in any 3-D environment under any lighting conditions.</p>\n<h4><a id=\"Registry_of_Open_Data_on_AWS_13\"></a><strong>Registry of Open Data on AWS</strong></h4>\n<p>Explore other Amazon datasets on the <a href=\"https://registry.opendata.aws/tag/amazon.science/\" target=\"_blank\">Registry of Open Data on AWS</a>, home to more than 268 publicly available datasets, including datasets useful to earth science, life sciences, and sustainability. Some examples of datasets are the <a href=\"https://aws.amazon.com/blogs/industries/nihs-sequence-read-archive-the-worlds-largest-genome-sequence-repository-openly-accessible-on-aws/\" target=\"_blank\">U.S. NIH Sequence Read Archive</a>, the <a href=\"https://registry.opendata.aws/noaa-gefs-reforecast/\" target=\"_blank\">USGS Landsat program</a>, and <a href=\"https://registry.opendata.aws/noaa-gefs-reforecast/\" target=\"_blank\">NOAA’s Global Ensemble Forecast System</a>.</p>\n<p>The dataset is licensed under the Creative Commons Attribution-NonCommercial 4.0 International Public License (CC BY-NC 4.0), which prohibits commercial use of the dataset but is otherwise nonrestrictive.</p>\n<p>Collectively — still images, thumbnails of the still images, 360° rotations, and 3-D models — the size of the dataset is almost 300 gigabytes. On the <a href=\"https://amazon-berkeley-objects.s3.amazonaws.com/index.html\" target=\"_blank\">dataset website</a>, researchers can download the entire dataset; browse product images filtered according to product name or type; or learn how to use a version of the dataset hosted by Amazon Web Services without having to download it.</p>\n<p>“Data has become the most important component of AI and machine learning,” says Matthieu Guillaumin, a senior applied scientist at Amazon who helped develop the dataset. “During this project, we have been driven by the hope to spark major innovations in product understanding that have the potential to benefit shoppers worldwide.”</p>\n"}
目录
亚马逊云科技解决方案 基于行业客户应用场景及技术领域的解决方案
联系亚马逊云科技专家
亚马逊云科技解决方案
基于行业客户应用场景及技术领域的解决方案
联系专家
0
目录
关闭