Amazon scientists welcome Iceland’s presidential delegation

自然语言处理

海外精选

海外精选的内容汇集了全球优质的亚马逊云科技相关技术内容。同时，内容中提到的“AWS” 是 “Amazon Web Services” 的缩写，在此网站不作为商标展示。

{"value":"![下载.jpg](https://dev-media.amazoncloud.cn/7ad29e581a6f4dedb7864820cfb33fd6_%E4%B8%8B%E8%BD%BD.jpg)\n\nTop row, center: H. E. Guðni Th. Jóhannesson, president of Iceland. Top row, from left of president: Robin Dautricourt, principal product manager, Amazon Polly; Jack FitzGerald, senior applied scientist, Amazon Alexa AI; Nikko Ström, distinguished scientist and VP, Amazon Alexa AI; Michele Butti, director, Amazon Alexa International. Top row, from right of president: Halldór Benjamín Þorbergsson, CEO of the Confederation of Icelandic Enterprise; Björgvin Ingi Ólafsson, member of the Board of Almannarómur, the Icelandic Center for Language Technology; Jón Guðnason, associate professor of signal processing and language technology, Reykjavík University. Middle row, from left of president: Nikulás Hannigan, Iceland’s trade commissioner to North America and consul general in New York; Stefanía G. Halldórsdóttir, chairman of the Board of Almannarómur. Middle row from right of president: Susan Pointer, VP, public policy, Amazon; Jóhanna Vigdís Guðmundsdottir, CEO of Almannarómur; Vilhjálmur Þorsteinsson, founder and CEO of Miðeind; Kristrún Heiða Hauksdóttir, specialist at the Ministry for Cultural and Business Affairs. Bottom row: Lilja Dögg Alfreðsdóttir, minister of cultural and business affairs; Nikhil Sharma, senior manager of product management, Amazon Text-to-Speech.\n\nRecently at our Seattle headquarters, Amazon had the pleasure of hosting Iceland’s President, H. E. Guðni Th. Jóhannesson, along with a delegation spanning Icelandic government officials, business leaders, and academics. It was truly an honor to meet with them.\n\nThe president’s visit to the region was part of a broader mission to preserve the Icelandic language in the digital age through its integration into all forms of technology. In this post, we’d like to highlight some of the exciting and innovative work Iceland has spearheaded in an effort to accelerate the digital integration of Icelandic. We have found these to be strong, collaborative tools, and we hope others do, too.\n\nSince 2019, Iceland’s government has been funding a ++[five-year language technology program](https://aclanthology.org/2020.lrec-1.418.pdf)++ for Icelandic, which has led to an impressive set of artifacts relevant to text-to-speech, speech recognition, and natural-language-processing. These include parallel datasets, pronunciation lexicons, text normalization mappings, speech data, treebanks, tokenizers, named-entity recognizers, and modeling recipes. These tools have important applications in all languages, particularly those with relatively small amounts of data for training machine learning models.\n\nThe program’s strategy is multifaceted, targeting everything from fundamental research to customer-facing products. Its five core research areas are language resources, speech recognition, speech synthesis, machine translation, and spelling and grammar checking.\n\nA list of selected resources appears below. We hope that you will use these resources in your own work, and we encourage you to keep an eye on the program’s progress.\n\nAdditionally, we’d like to highlight some work that Amazon has been doing for language expansion and low-data natural-language processing.\n\nWe recently launched the ++[MASSIVE](https://www.amazon.science/blog/amazon-releases-51-language-dataset-for-language-understanding)++ dataset, competition, and workshop, which will help advance the state of the art for multilingual natural-language understanding, for Icelandic and 50 other languages.\n\nAmazon Translate has expanded into ++[75 languages](https://aws.amazon.com/about-aws/whats-new/2021/11/amazon-translate-supports-four-additional-languages/)++, and Amazon Polly supports ++[33 languages](https://aws.amazon.com/polly/features/?nc=sn&loc=3)++, both including Icelandic. Language expansion and support is a consistent effort across many Amazon services and products.\n\nWe’ve also been busy in core scientific research, including research in ++[cross-lingual transfer learning](https://www.amazon.science/publications/exploring-cross-lingual-transfer-learning-with-unsupervised-machine-translation)++, ++[zero-shot transfer learning](https://www.amazon.science/publications/zero-shot-spoken-language-understanding-for-english-hindi-an-easy-victory-against-word-order-divergence)++, ++[multilingual training data generation](https://www.amazon.science/publications/multilingual-paraphrase-generation-for-bootstrapping-new-features-in-task-oriented-dialog-systems)++, ++[adversarial advertisement detection](https://www.amazon.science/publications/training-language-models-under-resource-constraints-for-adversarial-advertisement-detection)++, ++[text normalization for new languages](https://www.amazon.science/blog/text-normalization-with-only-3-as-much-training-data)++ in text-to-speech systems, and ++[continuous improvement with machine translation](https://www.amazon.science/publications/continuous-model-improvement-for-language-understanding-with-machine-translation)++. These are just a few examples. If you’d like to join us in tackling similar challenges, please visit our ++[careers page](https://www.amazon.science/careers)++.\n\nThe prevailing sentiment during our meeting with the Icelandic presidential delegation was one of optimism — optimism that developers everywhere can leverage recent and upcoming advances in artificial intelligence to accelerate the integration of Icelandic and other languages into all types of technology.\n\nKeep building.\n\n#### **Resources**\n\nHere are some resources provided to us by the Icelandic delegation that you may find useful:\n\n- ++[An overview of the program and past work](https://aclanthology.org/2020.lrec-1.418.pdf)++.\n- ++[Parallel text-speech database for TTS (Talrómur)](https://repository.clarin.is/repository/xmlui/handle/20.500.12537/104)++: The first part of the database (Talrómur 1) consists of 220 hours of studio-quality recordings from four female and four male voices. Each voice donor recorded between 10 and 30 hours of data, which should be sufficient to build a voice that sounds like that donor. The data is available under a Creative Commons 4.0 BY license.\n- ++[Talrómur 2](http://hdl.handle.net/20.500.12537/167)++: 80 hours of studio-quality recordings from 20 female and 20 male voices. Each voice donor recorded approximately two hours of data. While two hours might not be enough to create a voice from scratch based on a specific voice donor, it should be possible to join the voices in this dataset (and, indeed, in Talrómur 1) to create a voice that is a unique mix of the voices in the dataset. The data is available under a Creative Commons 4.0 BY license.\n- ++[Icelandic pronunciation dictionary](https://repository.clarin.is/repository/xmlui/handle/20.500.12537/154)++: A manually verified pronunciation lexicon containing almost 50,000 unique word forms transcribed in four pronunciation variants, often including a clear and a less formal transcription (reading pronunciation vs. casual-speech pronunciation). The repository contains transcription rules and guidelines followed in the project. The dictionary is available under a Creative Commons 4.0. BY license.\n- ++[Text normalization corpus](https://repository.clarin.is/repository/xmlui/handle/20.500.12537/158)++: A corpus of 40,000 sentences, manually normalized for TTS (an example of a normalization task in TTS is converting, e.g., “$30” to “thirty dollars”).\n- ++[Text preprocessing for TTS](https://github.com/grammatek/tts-frontend)++: A text-preprocessing pipeline connecting standalone modules for text cleaning, text normalization, phrasing, and grapheme-to-phoneme (g2p) conversion. The front-end pipeline and all submodules are available under an Apache 2.0 license.\n- ++[Recipes for Icelandic TTS](https://github.com/cadia-lvl/unit-selection-festival)++: Open-source TTS recipes for Icelandic have been made available as part of the Language Technology Programme for Icelandic (LTPI). A traditional unit selection recipe implemented in Festival is available here under an Apache 2.0 license.\n\n- ++[Neural-TTS recipe](https://github.com/cadia-lvl/FastSpeech2)++: Implemented in FastSpeech. Available under Apache 2.0 license.\n- ++[Talrómur 1 baseline models, train/test splits, and alignments](http://hdl.handle.net/20.500.12537/201)++\n- **Parallel text-speech database for ASR (Samrómur)**: The Samrómur crowd-sourcing platform is derived from the Mozilla Common Voice project. It is based on read prompts from volunteers and totals over 2,300 hours of data. The crowdsourcing statistics can be seen here. A concurrent verification effort has led to publications (under Creative Commons 4.0 BY licenses) that can, for example, be found here. A similar dataset of 152 hours of adult voices was collected around 2011 and is available here.\n- ++[Parliamentary speech data](https://catalog.ldc.upenn.edu/LDC2021S01)++: 542 hours of clean and verified speeches from the Icelandic parliament.\n\n#### **Other speech databases**\n- ++[193 hours of television and radio speech data](http://hdl.handle.net/20.500.12537/193)++\n- ++[21 hours of transcribed conversations](http://hdl.handle.net/20.500.12537/187)++\n- ++[51 hours of transcribed university lectures](http://hdl.handle.net/20.500.12537/171)++\n- ++[20 hours of read queries](http://hdl.handle.net/20.500.12537/180)++\n- ++[131 hours of children’s speech](http://hdl.handle.net/20.500.12537/185)++\n\n#### **Resources for ASR language modeling**\n- ++[The Icelandic Gigaword Corpus](https://repository.clarin.is/repository/xmlui/handle/20.500.12537/33)++\n\n#### **Other tools and recipes for ASR**\n- ++[Automatic punctuator for Icelandic](https://github.com/cadia-lvl/punctuation-prediction)++\n- ++[Open-source Kaldi recipes using Samrómur](https://github.com/cadia-lvl/samromur-asr)++\n\nABOUT THE AUTHOR\n\n#### **[Jack FitzGerald](https://www.amazon.science/author/jack-g-m-fitzgerald)**\nJack G. M. FitzGerald is a senior applied scientist in Alexa AI's Natural Understanding group.\n\n#### **[Nikko Ström](https://www.amazon.science/author/nikko-strom)**\nNikko Ström is a vice president and distinguished scientist in the Alexa AI organization.\n\n","render":"<img src=\"https://dev-media.amazoncloud.cn/7ad29e581a6f4dedb7864820cfb33fd6_%E4%B8%8B%E8%BD%BD.jpg\" alt=\"下载.jpg\" />\nTop row, center: H. E. Guðni Th. Jóhannesson, president of Iceland. Top row, from left of president: Robin Dautricourt, principal product manager, Amazon Polly; Jack FitzGerald, senior applied scientist, Amazon Alexa AI; Nikko Ström, distinguished scientist and VP, Amazon Alexa AI; Michele Butti, director, Amazon Alexa International. Top row, from right of president: Halldór Benjamín Þorbergsson, CEO of the Confederation of Icelandic Enterprise; Björgvin Ingi Ólafsson, member of the Board of Almannarómur, the Icelandic Center for Language Technology; Jón Guðnason, associate professor of signal processing and language technology, Reykjavík University. Middle row, from left of president: Nikulás Hannigan, Iceland’s trade commissioner to North America and consul general in New York; Stefanía G. Halldórsdóttir, chairman of the Board of Almannarómur. Middle row from right of president: Susan Pointer, VP, public policy, Amazon; Jóhanna Vigdís Guðmundsdottir, CEO of Almannarómur; Vilhjálmur Þorsteinsson, founder and CEO of Miðeind; Kristrún Heiða Hauksdóttir, specialist at the Ministry for Cultural and Business Affairs. Bottom row: Lilja Dögg Alfreðsdóttir, minister of cultural and business affairs; Nikhil Sharma, senior manager of product management, Amazon Text-to-Speech.\nRecently at our Seattle headquarters, Amazon had the pleasure of hosting Iceland’s President, H. E. Guðni Th. Jóhannesson, along with a delegation spanning Icelandic government officials, business leaders, and academics. It was truly an honor to meet with them.\nThe president’s visit to the region was part of a broader mission to preserve the Icelandic language in the digital age through its integration into all forms of technology. In this post, we’d like to highlight some of the exciting and innovative work Iceland has spearheaded in an effort to accelerate the digital integration of Icelandic. We have found these to be strong, collaborative tools, and we hope others do, too.\nSince 2019, Iceland’s government has been funding a <ins><a href=\"https://aclanthology.org/2020.lrec-1.418.pdf\" target=\"_blank\">five-year language technology program</a></ins> for Icelandic, which has led to an impressive set of artifacts relevant to text-to-speech, speech recognition, and natural-language-processing. These include parallel datasets, pronunciation lexicons, text normalization mappings, speech data, treebanks, tokenizers, named-entity recognizers, and modeling recipes. These tools have important applications in all languages, particularly those with relatively small amounts of data for training machine learning models.\nThe program’s strategy is multifaceted, targeting everything from fundamental research to customer-facing products. Its five core research areas are language resources, speech recognition, speech synthesis, machine translation, and spelling and grammar checking.\nA list of selected resources appears below. We hope that you will use these resources in your own work, and we encourage you to keep an eye on the program’s progress.\nAdditionally, we’d like to highlight some work that Amazon has been doing for language expansion and low-data natural-language processing.\nWe recently launched the <ins><a href=\"https://www.amazon.science/blog/amazon-releases-51-language-dataset-for-language-understanding\" target=\"_blank\">MASSIVE</a></ins> dataset, competition, and workshop, which will help advance the state of the art for multilingual natural-language understanding, for Icelandic and 50 other languages.\nAmazon Translate has expanded into <ins><a href=\"https://aws.amazon.com/about-aws/whats-new/2021/11/amazon-translate-supports-four-additional-languages/\" target=\"_blank\">75 languages</a></ins>, and Amazon Polly supports <ins><a href=\"https://aws.amazon.com/polly/features/?nc=sn&loc=3\" target=\"_blank\">33 languages</a></ins>, both including Icelandic. Language expansion and support is a consistent effort across many Amazon services and products.\nWe’ve also been busy in core scientific research, including research in <ins><a href=\"https://www.amazon.science/publications/exploring-cross-lingual-transfer-learning-with-unsupervised-machine-translation\" target=\"_blank\">cross-lingual transfer learning</a></ins>, <ins><a href=\"https://www.amazon.science/publications/zero-shot-spoken-language-understanding-for-english-hindi-an-easy-victory-against-word-order-divergence\" target=\"_blank\">zero-shot transfer learning</a></ins>, <ins><a href=\"https://www.amazon.science/publications/multilingual-paraphrase-generation-for-bootstrapping-new-features-in-task-oriented-dialog-systems\" target=\"_blank\">multilingual training data generation</a></ins>, <ins><a href=\"https://www.amazon.science/publications/training-language-models-under-resource-constraints-for-adversarial-advertisement-detection\" target=\"_blank\">adversarial advertisement detection</a></ins>, <ins><a href=\"https://www.amazon.science/blog/text-normalization-with-only-3-as-much-training-data\" target=\"_blank\">text normalization for new languages</a></ins> in text-to-speech systems, and <ins><a href=\"https://www.amazon.science/publications/continuous-model-improvement-for-language-understanding-with-machine-translation\" target=\"_blank\">continuous improvement with machine translation</a></ins>. These are just a few examples. If you’d like to join us in tackling similar challenges, please visit our <ins><a href=\"https://www.amazon.science/careers\" target=\"_blank\">careers page</a></ins>.\nThe prevailing sentiment during our meeting with the Icelandic presidential delegation was one of optimism — optimism that developers everywhere can leverage recent and upcoming advances in artificial intelligence to accelerate the integration of Icelandic and other languages into all types of technology.\nKeep building.\n<h4><a id=\"Resources_26\"></a>Resources</h4>\nHere are some resources provided to us by the Icelandic delegation that you may find useful:\n<ul>\n<li>\n<ins><a href=\"https://aclanthology.org/2020.lrec-1.418.pdf\" target=\"_blank\">An overview of the program and past work</a></ins>.\n</li>\n<li>\n<ins><a href=\"https://repository.clarin.is/repository/xmlui/handle/20.500.12537/104\" target=\"_blank\">Parallel text-speech database for TTS (Talrómur)</a></ins>: The first part of the database (Talrómur 1) consists of 220 hours of studio-quality recordings from four female and four male voices. Each voice donor recorded between 10 and 30 hours of data, which should be sufficient to build a voice that sounds like that donor. The data is available under a Creative Commons 4.0 BY license.\n</li>\n<li>\n<ins><a href=\"http://hdl.handle.net/20.500.12537/167\" target=\"_blank\">Talrómur 2</a></ins>: 80 hours of studio-quality recordings from 20 female and 20 male voices. Each voice donor recorded approximately two hours of data. While two hours might not be enough to create a voice from scratch based on a specific voice donor, it should be possible to join the voices in this dataset (and, indeed, in Talrómur 1) to create a voice that is a unique mix of the voices in the dataset. The data is available under a Creative Commons 4.0 BY license.\n</li>\n<li>\n<ins><a href=\"https://repository.clarin.is/repository/xmlui/handle/20.500.12537/154\" target=\"_blank\">Icelandic pronunciation dictionary</a></ins>: A manually verified pronunciation lexicon containing almost 50,000 unique word forms transcribed in four pronunciation variants, often including a clear and a less formal transcription (reading pronunciation vs. casual-speech pronunciation). The repository contains transcription rules and guidelines followed in the project. The dictionary is available under a Creative Commons 4.0. BY license.\n</li>\n<li>\n<ins><a href=\"https://repository.clarin.is/repository/xmlui/handle/20.500.12537/158\" target=\"_blank\">Text normalization corpus</a></ins>: A corpus of 40,000 sentences, manually normalized for TTS (an example of a normalization task in TTS is converting, e.g., “$30” to “thirty dollars”).\n</li>\n<li>\n<ins><a href=\"https://github.com/grammatek/tts-frontend\" target=\"_blank\">Text preprocessing for TTS</a></ins>: A text-preprocessing pipeline connecting standalone modules for text cleaning, text normalization, phrasing, and grapheme-to-phoneme (g2p) conversion. The front-end pipeline and all submodules are available under an Apache 2.0 license.\n</li>\n<li>\n<ins><a href=\"https://github.com/cadia-lvl/unit-selection-festival\" target=\"_blank\">Recipes for Icelandic TTS</a></ins>: Open-source TTS recipes for Icelandic have been made available as part of the Language Technology Programme for Icelandic (LTPI). A traditional unit selection recipe implemented in Festival is available here under an Apache 2.0 license.\n</li>\n<li>\n<ins><a href=\"https://github.com/cadia-lvl/FastSpeech2\" target=\"_blank\">Neural-TTS recipe</a></ins>: Implemented in FastSpeech. Available under Apache 2.0 license.\n</li>\n<li>\n<ins><a href=\"http://hdl.handle.net/20.500.12537/201\" target=\"_blank\">Talrómur 1 baseline models, train/test splits, and alignments</a></ins>\n</li>\n<li>\nParallel text-speech database for ASR (Samrómur): The Samrómur crowd-sourcing platform is derived from the Mozilla Common Voice project. It is based on read prompts from volunteers and totals over 2,300 hours of data. The crowdsourcing statistics can be seen here. A concurrent verification effort has led to publications (under Creative Commons 4.0 BY licenses) that can, for example, be found here. A similar dataset of 152 hours of adult voices was collected around 2011 and is available here.\n</li>\n<li>\n<ins><a href=\"https://catalog.ldc.upenn.edu/LDC2021S01\" target=\"_blank\">Parliamentary speech data</a></ins>: 542 hours of clean and verified speeches from the Icelandic parliament.\n</li>\n</ul>\n<h4><a id=\"Other_speech_databases_43\"></a>Other speech databases</h4>\n<ul>\n<li><ins><a href=\"http://hdl.handle.net/20.500.12537/193\" target=\"_blank\">193 hours of television and radio speech data</a></ins></li>\n<li><ins><a href=\"http://hdl.handle.net/20.500.12537/187\" target=\"_blank\">21 hours of transcribed conversations</a></ins></li>\n<li><ins><a href=\"http://hdl.handle.net/20.500.12537/171\" target=\"_blank\">51 hours of transcribed university lectures</a></ins></li>\n<li><ins><a href=\"http://hdl.handle.net/20.500.12537/180\" target=\"_blank\">20 hours of read queries</a></ins></li>\n<li><ins><a href=\"http://hdl.handle.net/20.500.12537/185\" target=\"_blank\">131 hours of children’s speech</a></ins></li>\n</ul>\n<h4><a id=\"Resources_for_ASR_language_modeling_50\"></a>Resources for ASR language modeling</h4>\n<ul>\n<li><ins><a href=\"https://repository.clarin.is/repository/xmlui/handle/20.500.12537/33\" target=\"_blank\">The Icelandic Gigaword Corpus</a></ins></li>\n</ul>\n<h4><a id=\"Other_tools_and_recipes_for_ASR_53\"></a>Other tools and recipes for ASR</h4>\n<ul>\n<li><ins><a href=\"https://github.com/cadia-lvl/punctuation-prediction\" target=\"_blank\">Automatic punctuator for Icelandic</a></ins></li>\n<li><ins><a href=\"https://github.com/cadia-lvl/samromur-asr\" target=\"_blank\">Open-source Kaldi recipes using Samrómur</a></ins></li>\n</ul>\nABOUT THE AUTHOR\n<h4><a id=\"Jack_FitzGeraldhttpswwwamazonscienceauthorjackgmfitzgerald_59\"></a><a href=\"https://www.amazon.science/author/jack-g-m-fitzgerald\" target=\"_blank\">Jack FitzGerald</a></h4>\nJack G. M. FitzGerald is a senior applied scientist in Alexa AI’s Natural Understanding group.\n<h4><a id=\"Nikko_Strmhttpswwwamazonscienceauthornikkostrom_62\"></a><a href=\"https://www.amazon.science/author/nikko-strom\" target=\"_blank\">Nikko Ström</a></h4>\nNikko Ström is a vice president and distinguished scientist in the Alexa AI organization.\n"}

亚马逊云科技解决方案基于行业客户应用场景及技术领域的解决方案

联系亚马逊云科技专家