Amazon helps launch workshop on synthetic data generation

海外精选

海外精选的内容汇集了全球优质的亚马逊云科技相关技术内容。同时，内容中提到的“AWS” 是 “Amazon Web Services” 的缩写，在此网站不作为商标展示。

{"value":"We are excited to announce the first Workshop on [Synthetic Data Generation](https://sdg-quality-privacy-bias.github.io/), to be held virtually at [ICLR 2021](https://www.amazon.science/conferences-and-events/iclr-2021) on May 7, 2021.\n\nSynthetic data is a powerful solution to two different problems: data limitations and privacy risks. In cases of limited labeled data, synthetic data can be used to augment training data, mitigating overfitting. In the case of protecting privacy, data curators can share synthetic data instead of real data in a manner that both protects the privacy of users and preserves the utility of the original data. \n\nAlthough these two scenarios share similar technical challenges, such as quality and fairness, they are often studied separately. Our workshop aims to deepen our understanding of the challenges of synthetic data generation in both scenarios.\n\n![image.png](https://dev-media.amazoncloud.cn/4d1b4692d6074f8fb16a8325bbf87ae7_image.png)\n\nTwo Amazon scientists, applied scientist Sergul Aydore (left) and principal scientist Krishnaram Kenthapadi, are among the organizers of the First Workshop on Data Augmentation at this year's ICLR.\n\nThe workshop is organized by a team of researchers from academia and industry with expertise in topics such as privacy, fairness, healthcare, and robustness in machine learning. The team consists of two Amazon scientists, [Sergul Aydore](https://www.amazon.science/author/sergul-aydore), an applied scientist on the Amazon Web Services external-security-services team, and [Krishnaram Kenthapadi](https://www.amazon.science/blog/null), a principal applied scientist on the Amazon Web Services machine learning team; [Haipeng Chen](https://haipeng-chen.github.io/), from Harvard University; [Edward Choi](https://mp2893.com/), from the Korea Advanced Institute of Science and Technology (KAIST); [Jamie Hayes](https://jamiehay.es/), from Google DeepMind; [Mario Fritz](https://cispa.saarland/group/fritz/), from the CISPA Helmholtz Center for Information Security; and [Rachel Cummings](https://sites.gatech.edu/rachel-cummings/), from Columbia University.\n\nOur workshop includes invited talks, contributed talks, poster sessions, and a panel discussion, and it involves a diverse group of researchers and practitioners. We are proud to host the following seven invited talks (in order of appearance):\n\n- Can machine learning revolutionize healthcare? Synthetic data may be the answer, [Mihaela van der Schaar](https://www.vanderschaar-lab.com/), University of Cambridge, the Alan Turing Institute, UCLA \n- Generative models for image synthesis, [Jan Kautz](https://jankautz.com/), NVIDIA \n- Differentially private synthetic data generations using generative adversarial networks, [Jinsung Yoon](https://jankautz.com/), Google Cloud AI \n- Towards financial synthetic data, [Manuela M. Veloso](http://www.cs.cmu.edu/~mmv/), J. P. Morgan, CMU \n- Bias and generalization of deep generative models, [Stefano Ermon](https://cs.stanford.edu/~ermon/), Stanford University \n- Generative modeling for music generation, [Sander Dieleman](https://benanne.github.io/), DeepMind \n- Ethical considerations of generative AI, [Emily Denton](https://cephaloponderer.com/), Google’s Ethical AI team\n\nThe workshop features [24 accepted papers](https://sdg-quality-privacy-bias.github.io/papers/), each of which will have an individual breakout session for a poster presentation. Among these papers, the following seven will have oral presentations:\n\n- Synthetic data for model selection, Matan Fintz (Amazon); Alon Shoshan (Technion); Nadav Bhonker (Amazon); Igor Kviatkovsky (Amazon); Gérard Medioni (USC) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_14.pdf))\n- Ensembles of GANs for synthetic training data generation, Gabriel Eilertsen (Linköping University); Apostolia Tsirikoglou (Linköping University); Claes Lundström (Linköping University); Jonas Unger (Linköping University) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_15.pdf))\n- Few-shot learning via tensor hallucination, Michalis M. L. Lazarou (Imperial College London); Tania Stathaki (Imperial College London); Yannis Avrithis (Inria) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_26.pdf))\n- Leveraging public data for practical private query release, Terrance Liu (Carnegie Mellon University); Giuseppe Vietri (University of Minnesota); Thomas Steinke (Google); Jonathan Ullman (Northeastern University); Steven Wu (Carnegie Mellon University) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_18.pdf))\n- FFPDG: Fast, fair and private data generation, Weijie Xu (Amazon); Jinjin Zhao (Amazon); Francis Iannacci (Amazon); Bo Wang (Amazon) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_26.pdf))\n- Overcoming barriers to data sharing with medical image generation: A comprehensive evaluation, August DuMont Schütte (Max Planck Institute for Intelligent Systems); Jürgen Hetzel (University Hospital of Tübingen); Sergios Gatidis (University of Tübingen); Tobias Hepp (Max Planck Institute for Intelligent Systems); Benedikt Dietz (ETH Zurich); Stefan Bauer (Max Planck Institute); Patrick Schwab (ETH Zurich) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_19.pdf))\n- Imperfect imaGANation: Implications of GANs exacerbating biases on facial data, Niharika Jain (Arizona State University); Alberto Olmo (Arizona State University); Sailik Sengupta (Arizona State University); Lydia Manikonda (Rensselaer Polytechnic Institute); Subbarao Kambhampati (Arizona State University) ([PDF](https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_6.pdf))\n\nWe will conclude the workshop with a panel discussion with the invited speakers and an award ceremony.\n\nABOUT THE AUTHOR\n#### **[Sergul Aydore](https://www.amazon.science/author/sergul-aydore)**\nSergul Aydore is an applied scientist with Amazon Web Services.\n#### **Krishnaram Kenthapadi**\nKrishnaram Kenthapadi is a principal scientist with Amazon Web Services.\n\n\n","render":"We are excited to announce the first Workshop on <a href=\\"https://sdg-quality-privacy-bias.github.io/\\" target=\\"_blank\\">Synthetic Data Generation</a>, to be held virtually at <a href=\\"https://www.amazon.science/conferences-and-events/iclr-2021\\" target=\\"_blank\\">ICLR 2021</a> on May 7, 2021.\\nSynthetic data is a powerful solution to two different problems: data limitations and privacy risks. In cases of limited labeled data, synthetic data can be used to augment training data, mitigating overfitting. In the case of protecting privacy, data curators can share synthetic data instead of real data in a manner that both protects the privacy of users and preserves the utility of the original data.\nAlthough these two scenarios share similar technical challenges, such as quality and fairness, they are often studied separately. Our workshop aims to deepen our understanding of the challenges of synthetic data generation in both scenarios.\n<img src=\\"https://dev-media.amazoncloud.cn/4d1b4692d6074f8fb16a8325bbf87ae7_image.png\\" alt=\\"image.png\\" />\nTwo Amazon scientists, applied scientist Sergul Aydore (left) and principal scientist Krishnaram Kenthapadi, are among the organizers of the First Workshop on Data Augmentation at this year’s ICLR.\nThe workshop is organized by a team of researchers from academia and industry with expertise in topics such as privacy, fairness, healthcare, and robustness in machine learning. The team consists of two Amazon scientists, <a href=\\"https://www.amazon.science/author/sergul-aydore\\" target=\\"_blank\\">Sergul Aydore</a>, an applied scientist on the Amazon Web Services external-security-services team, and <a href=\\"https://www.amazon.science/blog/null\\" target=\\"_blank\\">Krishnaram Kenthapadi</a>, a principal applied scientist on the Amazon Web Services machine learning team; <a href=\\"https://haipeng-chen.github.io/\\" target=\\"_blank\\">Haipeng Chen</a>, from Harvard University; <a href=\\"https://mp2893.com/\\" target=\\"_blank\\">Edward Choi</a>, from the Korea Advanced Institute of Science and Technology (KAIST); <a href=\\"https://jamiehay.es/\\" target=\\"_blank\\">Jamie Hayes</a>, from Google DeepMind; <a href=\\"https://cispa.saarland/group/fritz/\\" target=\\"_blank\\">Mario Fritz</a>, from the CISPA Helmholtz Center for Information Security; and <a href=\\"https://sites.gatech.edu/rachel-cummings/\\" target=\\"_blank\\">Rachel Cummings</a>, from Columbia University.\\nOur workshop includes invited talks, contributed talks, poster sessions, and a panel discussion, and it involves a diverse group of researchers and practitioners. We are proud to host the following seven invited talks (in order of appearance):\n<ul>\\n<li>Can machine learning revolutionize healthcare? Synthetic data may be the answer, <a href=\\"https://www.vanderschaar-lab.com/\\" target=\\"_blank\\">Mihaela van der Schaar</a>, University of Cambridge, the Alan Turing Institute, UCLA</li>\\n<li>Generative models for image synthesis, <a href=\\"https://jankautz.com/\\" target=\\"_blank\\">Jan Kautz</a>, NVIDIA</li>\\n<li>Differentially private synthetic data generations using generative adversarial networks, <a href=\\"https://jankautz.com/\\" target=\\"_blank\\">Jinsung Yoon</a>, Google Cloud AI</li>\\n<li>Towards financial synthetic data, <a href=\\"http://www.cs.cmu.edu/~mmv/\\" target=\\"_blank\\">Manuela M. Veloso</a>, J. P. Morgan, CMU</li>\\n<li>Bias and generalization of deep generative models, <a href=\\"https://cs.stanford.edu/~ermon/\\" target=\\"_blank\\">Stefano Ermon</a>, Stanford University</li>\\n<li>Generative modeling for music generation, <a href=\\"https://benanne.github.io/\\" target=\\"_blank\\">Sander Dieleman</a>, DeepMind</li>\\n<li>Ethical considerations of generative AI, <a href=\\"https://cephaloponderer.com/\\" target=\\"_blank\\">Emily Denton</a>, Google’s Ethical AI team</li>\\n</ul>\nThe workshop features <a href=\\"https://sdg-quality-privacy-bias.github.io/papers/\\" target=\\"_blank\\">24 accepted papers</a>, each of which will have an individual breakout session for a poster presentation. Among these papers, the following seven will have oral presentations:\\n<ul>\\n<li>Synthetic data for model selection, Matan Fintz (Amazon); Alon Shoshan (Technion); Nadav Bhonker (Amazon); Igor Kviatkovsky (Amazon); Gérard Medioni (USC) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_14.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n<li>Ensembles of GANs for synthetic training data generation, Gabriel Eilertsen (Linköping University); Apostolia Tsirikoglou (Linköping University); Claes Lundström (Linköping University); Jonas Unger (Linköping University) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_15.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n<li>Few-shot learning via tensor hallucination, Michalis M. L. Lazarou (Imperial College London); Tania Stathaki (Imperial College London); Yannis Avrithis (Inria) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_26.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n<li>Leveraging public data for practical private query release, Terrance Liu (Carnegie Mellon University); Giuseppe Vietri (University of Minnesota); Thomas Steinke (Google); Jonathan Ullman (Northeastern University); Steven Wu (Carnegie Mellon University) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_18.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n<li>FFPDG: Fast, fair and private data generation, Weijie Xu (Amazon); Jinjin Zhao (Amazon); Francis Iannacci (Amazon); Bo Wang (Amazon) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_26.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n<li>Overcoming barriers to data sharing with medical image generation: A comprehensive evaluation, August DuMont Schütte (Max Planck Institute for Intelligent Systems); Jürgen Hetzel (University Hospital of Tübingen); Sergios Gatidis (University of Tübingen); Tobias Hepp (Max Planck Institute for Intelligent Systems); Benedikt Dietz (ETH Zurich); Stefan Bauer (Max Planck Institute); Patrick Schwab (ETH Zurich) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_19.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n<li>Imperfect imaGANation: Implications of GANs exacerbating biases on facial data, Niharika Jain (Arizona State University); Alberto Olmo (Arizona State University); Sailik Sengupta (Arizona State University); Lydia Manikonda (Rensselaer Polytechnic Institute); Subbarao Kambhampati (Arizona State University) (<a href=\\"https://sdg-quality-privacy-bias.github.io/papers/SDG_paper_6.pdf\\" target=\\"_blank\\">PDF</a>)</li>\\n</ul>\nWe will conclude the workshop with a panel discussion with the invited speakers and an award ceremony.\nABOUT THE AUTHOR\n<h4><a id=\\"Sergul_Aydorehttpswwwamazonscienceauthorsergulaydore_35\\"></a><a href=\\"https://www.amazon.science/author/sergul-aydore\\" target=\\"_blank\\">Sergul Aydore</a></h4>\nSergul Aydore is an applied scientist with Amazon Web Services.\n<h4><a id=\\"Krishnaram_Kenthapadi_37\\"></a>Krishnaram Kenthapadi</h4>\\nKrishnaram Kenthapadi is a principal scientist with Amazon Web Services.\n"}

亚马逊云科技解决方案基于行业客户应用场景及技术领域的解决方案

联系亚马逊云科技专家