Amazon takes top three spots in Audio Anomaly Detection Challenge

机器学习

海外精选

海外精选的内容汇集了全球优质的亚马逊云科技相关技术内容。同时，内容中提到的“AWS” 是 “Amazon Web Services” 的缩写，在此网站不作为商标展示。

{"value":"This week at Amazon Web Services’ ++[re:Invent 2020](https://reinvent.awsevents.com/)++ conference, Amazon announced ++[Amazon Monitron](https://aws.amazon.com/monitron/)++, an end-to-end machine-monitoring system composed of sensors, a gateway, and a machine learning model that detects anomalies in vibration (structure-borne sound) or temperature and predicts when equipment may require maintenance. \n\nMachine condition monitoring was also the topic of a challenge at the ++[Workshop on the Detection and Classification of Acoustic Scenes and Events (DCASE 2020)](http://dcase.community/workshop2020/)++, in November, in which Amazon took the ++[top three spots](http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds-results#systems-ranking)++, out of 117 submissions.\n\nThe challenge was to determine whether the sounds emitted by a machine — such as a fan, pump, or valve — were normal or anomalous. Forty academic and industry teams submitted entries, an average of almost three submissions per team.\n\nIn a pair of papers (++[paper 1](https://www.amazon.science/publications/group-masked-autoencoder-based-density-estimator-for-audio-anomaly-detection)++|++[paper 2](https://www.amazon.science/publications/self-supervised-classification-for-detecting-anomalous-sounds)++) we presented at the workshop, we describe the two different neural-network-based approaches we took in our submissions to the challenge. The first of those papers won the workshop’s ++[best-paper award](http://dcase.community/articles/dcase2020-best-paper-awards)++.\n\n![image.png](https://dev-media.amazoncloud.cn/e77ad92ae8a642378fa22d1c2c474845_image.png)\n\nSpectrograms of audio clips recorded from a normal valve (top) and a faulty valve (bottom). The magnified details show the difference between the normal signal and the anomalous signal.\n\nAuditory machine condition monitoring has been common in industrial settings for several decades. Seasoned maintenance experts can identify problems in the machines they monitor just by listening to them and realizing that “something doesn’t sound right.” But by the time anomalies are audible to the human ear, the underlying problems may already be well advanced.\n\nWith the advent of machine learning and big data, there has been a lot of interest in teaching machines to detect anomalies sooner, to help predict when preventative maintenance might be necessary.\n\n\n#### **Data, labels, and rare failures**\n\n\nIn general, anomaly detection is the problem of identifying abnormal inputs in a stream of inputs. Depending on the available data, there are three different ways to train anomaly detection systems: (i) fully supervised, in which labeled examples of normal and abnormal data are presented; (ii) semi-supervised, in which only normal data is presented; and (iii) unsupervised, in which there are no labels in the data set, and outliers have to be classified automatically. \n\nAnomalies can manifest themselves in different ways. For instance, you can have slow *++[concept drift](https://en.wikipedia.org/wiki/Concept_drift)++* or sudden, instantaneous outliers. Typically, the data is also highly imbalanced — a lot more “normal” examples than “abnormal.”\n\nMachines worth monitoring carefully — especially those that are critical or expensive — are usually also well maintained. This means that they rarely fail, and gathering anomalous data from them is challenging and may take many years and lots of effort.\n\nAdditionally, machines operate in different modes and under variable load or performance conditions, and their characteristics can change over time as they age and approach steady state. Some industries’ operational profiles have seasonal variations as well. \n\nAll of these factors make anomaly detection challenging in the industrial setting. When implementing an anomaly detection system, one has to depend mostly on “normal” data, gathering additional data over time and eliciting user feedback. \n\nIf accurate physical models of machines are available, it may be possible to simulate failures and generate “abnormal” data that way. One can also generate anomalous data by inducing hardware failures in the lab. But one has to be prepared to work with minimal data when a machine is instrumented for the first time (the so-called cold-start problem).\n\n\n#### **Anomaly detection and our two neural approaches**\n\n\nThe papers we presented at DCASE (++[paper1](https://www.amazon.science/publications/group-masked-autoencoder-based-density-estimator-for-audio-anomaly-detection)++|++[paper2](https://www.amazon.science/publications/self-supervised-classification-for-detecting-anomalous-sounds)++) describe two different neural-network-based approaches to anomaly detection.\n\nThe first approach builds on recent advances in autoregressive neural-density estimation, or calculating a data distribution for streaming data by trying to predict each new data item on the basis of those that preceded it. As might be expected, such models are very sensitive to the order in which data arrives.\n\nAn earlier model, called the masked autoencoder for density estimation (MADE), makes a separate prediction for each feature — each dimension — of the input. With audio signals, however, the dimensions of the input are the energies in different frequency bands, which produce a composite picture of the signal that individual frequencies won’t capture. \n\nWe introduce a variation of MADE that bases its predictions on groups of input features — in this case, groups of frequency bands — and which we accordingly call Group MADE.\n\nIn the second paper, we use a self-supervised approach for representation learning, which has been successful recently in solving problems in vision and speech. We believe that we are the first to apply it to audio anomaly detection. \n\nIn the absence of anomalies in the training data, we trained a network to instead learn to distinguish multiple instances of machines within a given machine type. We found that the features learned by such a network were sensitive enough to detect delicate, previously unseen anomalies in the evaluation set. We used spectral warping and random mixing to simulate new machine instances in addition to the ones provided in the dataset. \n\n![image.png](https://dev-media.amazoncloud.cn/9785624c5faa41e9a1648a534a0ac80d_image.png)\n\nTwo-dimensional visualizations of two different representations of the Toy Car sounds in the DCASE data set: the raw spectrograms (left) and the features learned by our self-supervised model (right). The blue samples represent data from a normal machine, the red samples data from an anomalous one. In the raw spectrograms, there is little separation between the normal and anomalous samples, while the learned features can much more clearly separate out the anomalous samples.\n\nThe DCASE challenge provided data from six different machines: fan, pump, slide rail, valve, toy car, and toy conveyor. DCASE also ++[provided](http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds#audio-dataset)++ a development data set and a separate evaluation data set. Scoring was calculated using ++[area under the ROC curve](http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds#evaluation)++ (AUC) and ++[partial area under the ROC curve](http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds#evaluation)++. The ROC curve maps false-positive rate against false-negative rate, so the area under the curve indicates how well a given system manages that trade-off; partial AUC is the AUC over a small false-positive-rate range, in this case [0, 0.1]. \n\nThe table below shows the accuracies we were able to obtain, both for the challenge and since the challenge. We have developed a third approach that helped improve some of these numbers, which we will detail in a future publication. \n\nThe challenge ranking method involved two steps, to account for the the disparate difficulty levels across various machine types. First, machine-specific rankings were assigned to all submissions, based on AUC and pAUC. The submissions were then ranked by the average of their machine-specific ranks. Please see the full leaderboard ++[here](http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds-results)++.\n\nWhile our models won the challenge using the across-all-machine-types scoring described above, fine-tuning them for specific machine types yielded the results in the last row.\n\n![image.png](https://dev-media.amazoncloud.cn/e08eef910694449096ec751799ff2205_image.png)\n\nWe believe that as more industrial machine data is accumulated and curated over the next few years, machine learning and neural-network-based approaches will start making a huge difference in the monitoring and maintenance of machines, and Amazon Web Services and its services will be at the forefront of this revolution.\n\nABOUT THE AUTHOR\n\n\n#### **[Arvindh Krishnaswamy](https://www.amazon.science/author/arvindh-krishnaswamy)**\n\n\nArvindh Krishnaswamy is a principal scientist with Amazon Web Services.\n","render":"This week at Amazon Web Services’ <ins><a href=\\"https://reinvent.awsevents.com/\\" target=\\"_blank\\">re:Invent 2020</a></ins> conference, Amazon announced <ins><a href=\\"https://aws.amazon.com/monitron/\\" target=\\"_blank\\">Amazon Monitron</a></ins>, an end-to-end machine-monitoring system composed of sensors, a gateway, and a machine learning model that detects anomalies in vibration (structure-borne sound) or temperature and predicts when equipment may require maintenance.\nMachine condition monitoring was also the topic of a challenge at the <ins><a href=\\"http://dcase.community/workshop2020/\\" target=\\"_blank\\">Workshop on the Detection and Classification of Acoustic Scenes and Events (DCASE 2020)</a></ins>, in November, in which Amazon took the <ins><a href=\\"http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds-results#systems-ranking\\" target=\\"_blank\\">top three spots</a></ins>, out of 117 submissions.\nThe challenge was to determine whether the sounds emitted by a machine — such as a fan, pump, or valve — were normal or anomalous. Forty academic and industry teams submitted entries, an average of almost three submissions per team.\nIn a pair of papers (<ins><a href=\\"https://www.amazon.science/publications/group-masked-autoencoder-based-density-estimator-for-audio-anomaly-detection\\" target=\\"_blank\\">paper 1</a></ins>|<ins><a href=\\"https://www.amazon.science/publications/self-supervised-classification-for-detecting-anomalous-sounds\\" target=\\"_blank\\">paper 2</a></ins>) we presented at the workshop, we describe the two different neural-network-based approaches we took in our submissions to the challenge. The first of those papers won the workshop’s <ins><a href=\\"http://dcase.community/articles/dcase2020-best-paper-awards\\" target=\\"_blank\\">best-paper award</a></ins>.\n<img src=\\"https://dev-media.amazoncloud.cn/e77ad92ae8a642378fa22d1c2c474845_image.png\\" alt=\\"image.png\\" />\nSpectrograms of audio clips recorded from a normal valve (top) and a faulty valve (bottom). The magnified details show the difference between the normal signal and the anomalous signal.\nAuditory machine condition monitoring has been common in industrial settings for several decades. Seasoned maintenance experts can identify problems in the machines they monitor just by listening to them and realizing that “something doesn’t sound right.” But by the time anomalies are audible to the human ear, the underlying problems may already be well advanced.\nWith the advent of machine learning and big data, there has been a lot of interest in teaching machines to detect anomalies sooner, to help predict when preventative maintenance might be necessary.\n<h4><a id=\\"Data_labels_and_rare_failures_17\\"></a>Data, labels, and rare failures</h4>\\nIn general, anomaly detection is the problem of identifying abnormal inputs in a stream of inputs. Depending on the available data, there are three different ways to train anomaly detection systems: (i) fully supervised, in which labeled examples of normal and abnormal data are presented; (ii) semi-supervised, in which only normal data is presented; and (iii) unsupervised, in which there are no labels in the data set, and outliers have to be classified automatically.\nAnomalies can manifest themselves in different ways. For instance, you can have slow <ins><a href=\\"https://en.wikipedia.org/wiki/Concept_drift\\" target=\\"_blank\\">concept drift</a></ins> or sudden, instantaneous outliers. Typically, the data is also highly imbalanced — a lot more “normal” examples than “abnormal.”\\nMachines worth monitoring carefully — especially those that are critical or expensive — are usually also well maintained. This means that they rarely fail, and gathering anomalous data from them is challenging and may take many years and lots of effort.\nAdditionally, machines operate in different modes and under variable load or performance conditions, and their characteristics can change over time as they age and approach steady state. Some industries’ operational profiles have seasonal variations as well.\nAll of these factors make anomaly detection challenging in the industrial setting. When implementing an anomaly detection system, one has to depend mostly on “normal” data, gathering additional data over time and eliciting user feedback.\nIf accurate physical models of machines are available, it may be possible to simulate failures and generate “abnormal” data that way. One can also generate anomalous data by inducing hardware failures in the lab. But one has to be prepared to work with minimal data when a machine is instrumented for the first time (the so-called cold-start problem).\n<h4><a id=\\"Anomaly_detection_and_our_two_neural_approaches_33\\"></a>Anomaly detection and our two neural approaches</h4>\\nThe papers we presented at DCASE (<ins><a href=\\"https://www.amazon.science/publications/group-masked-autoencoder-based-density-estimator-for-audio-anomaly-detection\\" target=\\"_blank\\">paper1</a></ins>|<ins><a href=\\"https://www.amazon.science/publications/self-supervised-classification-for-detecting-anomalous-sounds\\" target=\\"_blank\\">paper2</a></ins>) describe two different neural-network-based approaches to anomaly detection.\nThe first approach builds on recent advances in autoregressive neural-density estimation, or calculating a data distribution for streaming data by trying to predict each new data item on the basis of those that preceded it. As might be expected, such models are very sensitive to the order in which data arrives.\nAn earlier model, called the masked autoencoder for density estimation (MADE), makes a separate prediction for each feature — each dimension — of the input. With audio signals, however, the dimensions of the input are the energies in different frequency bands, which produce a composite picture of the signal that individual frequencies won’t capture.\nWe introduce a variation of MADE that bases its predictions on groups of input features — in this case, groups of frequency bands — and which we accordingly call Group MADE.\nIn the second paper, we use a self-supervised approach for representation learning, which has been successful recently in solving problems in vision and speech. We believe that we are the first to apply it to audio anomaly detection.\nIn the absence of anomalies in the training data, we trained a network to instead learn to distinguish multiple instances of machines within a given machine type. We found that the features learned by such a network were sensitive enough to detect delicate, previously unseen anomalies in the evaluation set. We used spectral warping and random mixing to simulate new machine instances in addition to the ones provided in the dataset.\n<img src=\\"https://dev-media.amazoncloud.cn/9785624c5faa41e9a1648a534a0ac80d_image.png\\" alt=\\"image.png\\" />\nTwo-dimensional visualizations of two different representations of the Toy Car sounds in the DCASE data set: the raw spectrograms (left) and the features learned by our self-supervised model (right). The blue samples represent data from a normal machine, the red samples data from an anomalous one. In the raw spectrograms, there is little separation between the normal and anomalous samples, while the learned features can much more clearly separate out the anomalous samples.\nThe DCASE challenge provided data from six different machines: fan, pump, slide rail, valve, toy car, and toy conveyor. DCASE also <ins><a href=\\"http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds#audio-dataset\\" target=\\"_blank\\">provided</a></ins> a development data set and a separate evaluation data set. Scoring was calculated using <ins><a href=\\"http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds#evaluation\\" target=\\"_blank\\">area under the ROC curve</a></ins> (AUC) and <ins><a href=\\"http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds#evaluation\\" target=\\"_blank\\">partial area under the ROC curve</a></ins>. The ROC curve maps false-positive rate against false-negative rate, so the area under the curve indicates how well a given system manages that trade-off; partial AUC is the AUC over a small false-positive-rate range, in this case [0, 0.1].\nThe table below shows the accuracies we were able to obtain, both for the challenge and since the challenge. We have developed a third approach that helped improve some of these numbers, which we will detail in a future publication.\nThe challenge ranking method involved two steps, to account for the the disparate difficulty levels across various machine types. First, machine-specific rankings were assigned to all submissions, based on AUC and pAUC. The submissions were then ranked by the average of their machine-specific ranks. Please see the full leaderboard <ins><a href=\\"http://dcase.community/challenge2020/task-unsupervised-detection-of-anomalous-sounds-results\\" target=\\"_blank\\">here</a></ins>.\nWhile our models won the challenge using the across-all-machine-types scoring described above, fine-tuning them for specific machine types yielded the results in the last row.\n<img src=\\"https://dev-media.amazoncloud.cn/e08eef910694449096ec751799ff2205_image.png\\" alt=\\"image.png\\" />\nWe believe that as more industrial machine data is accumulated and curated over the next few years, machine learning and neural-network-based approaches will start making a huge difference in the monitoring and maintenance of machines, and Amazon Web Services and its services will be at the forefront of this revolution.\nABOUT THE AUTHOR\n<h4><a id=\\"Arvindh_Krishnaswamyhttpswwwamazonscienceauthorarvindhkrishnaswamy_67\\"></a><a href=\\"https://www.amazon.science/author/arvindh-krishnaswamy\\" target=\\"_blank\\">Arvindh Krishnaswamy</a></h4>\nArvindh Krishnaswamy is a principal scientist with Amazon Web Services.\n"}

亚马逊云科技解决方案基于行业客户应用场景及技术领域的解决方案

联系亚马逊云科技专家