Extract insights from SAP ERP with no-code ML solutions with Amazon AppFlow and Amazon SageMaker Canvas

海外精选
海外精选的内容汇集了全球优质的亚马逊云科技相关技术内容。同时,内容中提到的“AWS” 是 “Amazon Web Services” 的缩写,在此网站不作为商标展示。
0
0
{"value":"\nCustomers in industries like consumer packaged goods, manufacturing, and retail are always looking for ways to empower their operational processes by enriching them with insights and analytics generated from data. Tasks like sales forecasting directly affect operations such as raw material planning, procurement, manufacturing, distribution, and inbound/outbound logistics, and it can have many levels of impact, from a single warehouse all the way to large-scale production facilities.\n\nSales representatives and managers use historical sales data to make informed predictions about future sales trends. Customers use SAP ERP Central Component (ECC) to manage planning for the manufacturing, sale, and distribution of goods. The sales and distribution (SD) module within SAP ECC helps manage sales orders. SAP systems are the primary source of historical sales data.\n\nSales representatives and managers have the domain knowledge and in-depth understanding of their sales data. However, they lack data science and programming skills to create machine learning (ML) models that can generate sales forecasts. They seek intuitive, simple-to-use tools to create ML models without writing a single line of code.\n\nTo help organizations achieve the agility and effectiveness that business analysts seek, we [introduced](https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-canvas-a-visual-no-code-machine-learning-capability-for-business-analysts/) [Amazon SageMaker Canvas](https://aws.amazon.com/sagemaker/canvas/), a no-code ML solution that helps you accelerate delivery of ML solutions down to hours or days. SageMaker Canvas enables analysts to easily use available data in data lakes, data warehouses, and operational data stores; build ML models; and use them to make predictions interactively and for batch scoring on bulk datasets—all without writing a single line of code.\n\nIn this post, we show how to bring sales order data from SAP ECC to generate sales forecasts using an ML model built using SageMaker Canvas.\n\n#### **Solution overview**\n\nTo generate sales forecasts using SAP sales data, we need the collaboration of two personas: data engineers and business analysts (sales representatives and managers). Data engineers are responsible for configuring the data export from the SAP system to [Amazon Simple Storage](http://aws.amazon.com/s3) Service (Amazon S3) using [Amazon AppFlow](https://aws.amazon.com/appflow/), which business analysts can then run either on-demand or automatically (schedule-based) to refresh SAP data in the S3 bucket. Business analysts are then responsible for generating forecasts with the exported data using SageMaker Canvas. The following diagram illustrates this workflow.\n\n![image.png](https://dev-media.amazoncloud.cn/968457e7e8b44dceb4b20e7cb88d0f35_image.png)\n\nFor this post, we use SAP [NetWeaver Enterprise Procurement Model](https://help.sap.com/viewer/a602ff71a47c441bb3000504ec938fea/7.51.13/en-US/57e9f59831e94311852a2af18ab733b5.html) (EPM) for the sample data. EPM is generally used for demonstration and testing purposes in SAP. It uses common business process model and follows the business object (BO) paradigm to support a well-defined business logic. We used the SAP transaction SEPM_DG (data generator) to generate around 80,000 historical sales orders and created a HANA CDS view to aggregate the data by product ID, sales date, and city, as shown in the following code:\n\n```\n@AbapCatalog.sqlViewName: 'ZCDS_EPM_VIEW'\n@AbapCatalog.compiler.compareFilter: true\n@AbapCatalog.preserveKey: true\n@AccessControl.authorizationCheck: #CHECK\n@EndUserText.label: 'Sagemaker canvas sales order'\n@OData.publish: true \ndefine view ZCDS_EPM as select from epm_v_sales_data as sd\ninner join epm_v_bp as bp\n on sd.bp_id = bp.bp_id {\n key sd.product_id as productid,\n bp.city,\n concat( cast(\n Concat(\n Concat(\n Concat(substring(cast (sd.created_at as abap.char( 30 )), 1, 4), '-'),\n Concat(substring(cast (sd.created_at as abap.char( 30 )), 5, 2), '-')\n ),\n Substring(cast (sd.created_at as abap.char( 30 )), 7, 2)\n )\n as char10 preserving type),' 00:00:00') as saledate,\n cast(sum(sd.gross_amount) as abap.dec( 15, 3 )) as totalsales \n}\ngroup by sd.product_id,sd.created_at, bp.city\n```\nIn the next section, we expose this view using SAP OData services as ABAP structure, which allows us to extract the data with Amazon AppFlow.\n\nThe following table shows the representative historical sales data from SAP, which we use in this post.\n\n![image.png](1)\n\nThe data file is daily frequency historical data. It has four columns (```productid```, ```saledate```, ```city```, and ```totalsales```). We use SageMaker Canvas to build an ML model that is used to forecast ```totalsales```for ```productid ```in a particular city.\n\nThis post has been organized to show the activities and responsibilities for both data engineers and business analysts to generate product sales forecasts.\n\n#### **Data engineer: Extract, transform, and load the dataset from SAP to Amazon S3 with Amazon AppFlow**\n\nThe first task you perform as a data engineer is to run an extract, transform, and load (ETL) job on historical sales data from SAP ECC to an S3 bucket, which the business analyst uses as the source dataset for their forecasting model. For this, we use Amazon AppFlow, because it provides an out-of-the-box [SAP OData Connector](https://docs.aws.amazon.com/appflow/latest/userguide/sapodata.html) for ETL (as shown in the following diagram), with a simple UI to set up everything needed to configure the connection from the SAP ECC to the S3 bucket.\n\n![image.png](https://dev-media.amazoncloud.cn/053ad7ec9d894ad69fe0dae9a88b36c4_image.png)\n\n#### **Prerequisites**\n\nThe following are requirements to integrate Amazon AppFlow with SAP:\n\n- SAP NetWeaver Stack version 7.40 SP02 or above\n- Catalog service (OData v2.0/v2.0) enabled in SAP Gateway for service discovery\n- Support for client-side pagination and query options for SAP OData Service\n- HTTPS enabled connection to SAP\n\n#### **Authentication**\nAmazon AppFlow supports two authentication mechanisms to connect to SAP:\n\n- Basic – Authenticates using SAP OData user name and password.\n- OAuth 2.0 – Uses OAuth 2.0 configuration with an identity provider. OAuth 2.0 must be enabled for OData v2.0/v2.0 services.\n\n#### **Connection**\nAmazon AppFlow can connect to SAP ECC using a public SAP OData interface or a private connection. A private connection improves data privacy and security by transferring data through the private AWS network instead of the public internet. A private connection uses the VPC endpoint service for the SAP OData instance running in a VPC. The VPC endpoint service must have the Amazon AppFlow service principal```appflow.amazonaws.com``` as an allowed principal and must be available in at least more than 50% of the Availability Zones in an AWS Region.\n\n#### **Set up a flow in Amazon AppFlow**\n\nWe configure a new flow in Amazon AppFlow to run an ETL job on data from SAP to an S3 bucket. This flow allows for configuration of the SAP OData Connector as source, S3 bucket as destination, OData object selection, data mapping, data validation, and data filtering.\n\n1. Configure the SAP OData Connector as a data source by providing the following information:\na. Application host URL\nb. Application service path (catalog path)\nc. Port number\nd. Client number\ne. Logon language\nf. Connection type (private link or public)\ng. Authentication mode\nh. Connection name for the configuration\n\n![image.png](https://dev-media.amazoncloud.cn/e91625935c664aa1ac39d50260c471d9_image.png)\n\n2. After you configure the source, choose the OData object and subobject for the sales orders.\nGenerally, sales data from SAP is exported at a certain frequency, such as monthly or quarterly for the full size. For this post, choose the subobject option for the full-size export.\n\n![image.png](https://dev-media.amazoncloud.cn/7e6ecb158b0a4238a7c22fcb0e5c8276_image.png)\n\n3. Choose the S3 bucket as the destination.\nThe flow exports data to this bucket.\n\n![image.png](https://dev-media.amazoncloud.cn/68fdba9feb9c4cfaa3828a80f8cd0bff_image.png)\n\n4. For **Data format preference**, select **CSV format**.\n5. For **Data transfer preference**, select **Aggregate all records**.\n6. For **Filename preference**, select **Add a timestamp to the file name**.\n7. For **Folder structure preference**, select **No timestamped folder**.\nThe **record aggregation configuration** exports the full-size sales data from SAP combined in a single file. The file name ends with a timestamp in the YYYY-MM-DDTHH:mm:ss format in a single folder (flow name) within the S3 bucket. SageMaker Canvas imports data from this single file for model training and forecasting.\n\n![image.png](https://dev-media.amazoncloud.cn/2301826da2b3433a86df6756cf87faf3_image.png)\n\n8. Configure data mapping and validations to map the source data fields to destination data fields, and enable data validation rules as required.\n\n![image.png](https://dev-media.amazoncloud.cn/3ec7a96865b6459d94c30afb091248d5_image.png)\n\n9. You also configure data filtering conditions to filter out specific records if your requirement demands.\n\n![image.png](https://dev-media.amazoncloud.cn/a206d7f27992436a9eae8155e4090deb_image.png)\n\n10. Configure your flow trigger to decide whether the flow runs manually on-demand or automatically based on a schedule.\nWhen configured for a schedule, the frequency is based on how frequently the forecast needs to be generated (generally monthly, quarterly, or half-yearly).\n\n![image.png](https://dev-media.amazoncloud.cn/8056730e8115429f96f06ac61ccde0a9_image.png)\n\nAfter the flow is configured, the business analysts can run it on demand or based on the schedule to perform an ETL job on the sales order data from SAP to an S3 bucket.\n11. In addition to the Amazon AppFlow configuration, the data engineers also need to configure an [AWS Identity and Access Management](http://aws.amazon.com/iam) (IAM) role for SageMaker Canvas so that it can access other AWS services. For instructions, refer to [Give your users permissions to perform time series forecasting](https://docs.aws.amazon.com/sagemaker/latest/dg/canvas-set-up-forecast.html).\n\n#### **Business analyst: Use the historical sales data to train a forecasting model**\n\nLet’s switch gears and move to the business analyst side. As a business analyst, we’re looking for a visual, point-and-click service that makes it easy to build ML models and generate accurate predictions without writing a single line of code or having ML expertise. SageMaker Canvas fits the requirement as no-code ML solution.\n\nFirst, make sure that your IAM role is configured in such a way that SageMaker Canvas can access other AWS services. For more information, refer to [Give your users permissions to perform time series forecasting](https://docs.aws.amazon.com/en_jp/sagemaker/latest/dg/canvas-set-up-forecast.html), or you can ask for help to your Cloud Engineering team.\n\nWhen the data engineer is done setting up the Amazon AppFlow-based ETL configuration, the historical sales data is available for you in an S3 bucket.\n\n![image.png](https://dev-media.amazoncloud.cn/aad3f3796bf1490dba548d5193ac6aed_image.png)\n\nYou’re now ready to train a model with SageMaker Canvas! This typically involves four steps: importing data into the service, configuring the model training by selecting the appropriate model type, training the model, and finally generating forecasts using the model.\n\n#### **Import data in SageMaker Canvas**\n\nFirst, launch the SageMaker Canvas app from the [Amazon SageMaker](https://aws.amazon.com/sagemaker/) console or from your single sign-on access. If you don’t know how to do that, contact your administrator so that they can guide you through the process of setting up SageMaker Canvas. Make sure that you access the service in the same Region as the S3 bucket containing the historical dataset from SAP. You should see a screen like the following.\n\n![image.png](https://dev-media.amazoncloud.cn/1d861bad474c49d584c4e4f68a942e36_image.png)\n\nThen complete the following steps:\n\n1. In SageMaker Canvas, choose **Datasets** in the navigation pane.\n2. Choose **Import** to start importing data from the S3 bucket.\n\n![image.png](https://dev-media.amazoncloud.cn/57f6d011faf542028253038fdaee2583_image.png)\n\n3. On the import screen, choose the data file or object from the S3 bucket to import the training data.\n\n![image.png](https://dev-media.amazoncloud.cn/1313b7cdc2d343c3a723053a08079d87_image.png)\n\nYou can import multiple datasets in SageMaker Canvas. It also supports creating joins between the datasets by choosing **Join data**, which is particularly useful when the training data is spread across multiple files.\n\n**Configure and train the model**\n\nAfter you import the data, complete the following steps:\n\n1. Choose **Models** in the navigation pane.\n2. Choose **New model** to start configuration for training the forecast model.\n\n![image.png](https://dev-media.amazoncloud.cn/d5cc89ded78349399fd639c2ade4c7b4_image.png)\n\n3. For the new model, give it a suitable name, such as ```product_sales_forecast_model```.\n4. Select the sales dataset and choose **Select dataset**.\n\n![image.png](https://dev-media.amazoncloud.cn/3f53ffba7eef47d3aae49f3e0c8eb34e_image.png)\n\nAfter the dataset is selected, you can see data statistics and configure the model training on the Build tab.\n\n![image.png](https://dev-media.amazoncloud.cn/c1fd350d83d54500b26b0f31f52c3e7e_image.png)\n\n5. Select **totalsales** as the target column for the prediction.\nYou can see **Time series forecasting** is automatically selected as the model type.\n6. Choose **Configure**.\n\n![image.png](https://dev-media.amazoncloud.cn/2027beb710154f89b2a4f93f4cac59b3_image.png)\n\n7. In the **Time series forecasting** configuration section, choose **productid**for **Item ID column**.\n8. Choose **city**for **Group column**.\n9. Choose **saledate**for **Time stamp column**.\n10. For **Days**, enter ```120```.\n11. Choose **Save**.\nThis configures the model to make forecasts for ```totalsales ```for 120 days using ```saledate```based on historical data, which can be queried for ```productid```and ```city```.\n\n![image.png](https://dev-media.amazoncloud.cn/0c86b0fa08ef45ada431d528903ea637_image.png)\n\n12. When the model training configuration is complete, choose **Standard Build** to start the model training.\n\n\nThe **Preview model** option is not available for time series forecasting model type. You can review the estimated time for the model training on the **Analyze** tab.\n\n![image.png](https://dev-media.amazoncloud.cn/632c7dd3291947afbfdf531a9d8337fc_image.png)\n\nModel training might take 1–4 hours to complete, depending on the data size. When the model is ready, you can use it to generate the forecast.\n\n#### **Generate a forecast**\n\nWhen the model training is complete, it shows prediction accuracy of the model on the **Analyze **tab. For instance, in this example, it shows prediction accuracy as 92.87%.\n\n![image.png](https://dev-media.amazoncloud.cn/84920d136dd9453aa8acdbefc9c100f4_image.png)\n\nThe forecast is generated on the **Predict **tab. You can generate forecasts for all the items or a selected single item. It also shows the date range for which the forecast can be generated.\n\n![image.png](https://dev-media.amazoncloud.cn/4cc3cd7aa08f4c82a8536202e488db08_image.png)\n\nAs an example, choose the **Single item** option. Select **P-2** for **Item** and **Quito** for **Group** to generate a prediction for product P-2 for city Quito for the date range 2017-08-15 00:00:00 through 2017-12-13 00:00:00.\n\n![image.png](https://dev-media.amazoncloud.cn/e3f0742b74904dcd97036761d0812e31_image.png)\n\nThe generated forecast shows the average forecast as well as the upper and lower bound of the forecast. The forecast bounds help configure an aggressive or balanced approach for the forecast handling.\n\nYou can also download the generated forecast as a CSV file or image. The generated forecast CSV file is generally to used to work offline with the forecast data.\n\n![image.png](https://dev-media.amazoncloud.cn/94816028493943a7bbda0857733557c6_image.png)\n\nThe forecast is now generated for the time series data. When a new baseline of data becomes available for the forecast, you can change the dataset in SageMaker Canvas to retrain the forecast model using the new baseline.\n\n![image.png](https://dev-media.amazoncloud.cn/77cea97dd194441596a7863a02010043_image.png)\n\nYou can retrain the model multiple times as and when the training data changes.\n\n#### **Clean up**\n\nTo avoid incurring future [session charges](https://aws.amazon.com/sagemaker/canvas/pricing), log out of SageMaker Canvas.\n\n![image.png](https://dev-media.amazoncloud.cn/26bfea6caa7e47918c7e16307d2f9d3c_image.png)\n\n#### **Conclusion**\n\nIn this post, you learned how the Amazon AppFlow SAP OData Connector exports sales order data from the SAP system into an S3 bucket and then how to use SageMaker Canvas to build a model for forecasting.\n\nYou can use SageMaker Canvas for any SAP time series data scenarios, such as expense or revenue prediction. The entire forecast generation process is configuration driven. Sales managers and representatives can generate sales forecasts repeatedly per month or per quarter with a refreshed set of data in a fast, straightforward, and intuitive way without writing a single line of code. This helps improve productivity and enables quick planning and decisions.\n\nTo get started, learn more about SageMaker Canvas and Amazon AppFlow using the following resources:\n\n- [Amazon SageMaker Canvas Developer Guide](https://docs.aws.amazon.com/sagemaker/latest/dg/canvas.html)\n- [Announcing Amazon SageMaker Canvas – a Visual, No Code Machine Learning Capability for Business Analysts](https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-canvas-a-visual-no-code-machine-learning-capability-for-business-analysts/)\n- [Extract data from SAP ERP and BW with Amazon AppFlow](https://aws.amazon.com/blogs/awsforsap/extract-data-from-sap-erp-and-bw-with-amazon-appflow/)\n- [SAP OData Connector configuration](https://docs.aws.amazon.com/appflow/latest/userguide/sapodata.html)\n\n#### **About the Authors**\n\n![image.png](https://dev-media.amazoncloud.cn/2f6c7654aa474d518319843117e58242_image.png)\n\n**Brajendra Singh** is solution architect in Amazon Web Services working with enterprise customers. He has strong developer background and is a keen enthusiast for data and machine learning solutions.\n\n![image.png](https://dev-media.amazoncloud.cn/11acf9b344da4afc98b3d782f3501374_image.png)\n\n**Davide Gallitelli** is a Specialist Solutions Architect for AI/ML in the EMEA region. He is based in Brussels and works closely with customers throughout Benelux. He has been a developer since he was very young, starting to code at the age of 7. He started learning AI/ML at university, and has fallen in love with it since then.\n\n\n","render":"<p>Customers in industries like consumer packaged goods, manufacturing, and retail are always looking for ways to empower their operational processes by enriching them with insights and analytics generated from data. Tasks like sales forecasting directly affect operations such as raw material planning, procurement, manufacturing, distribution, and inbound/outbound logistics, and it can have many levels of impact, from a single warehouse all the way to large-scale production facilities.</p>\n<p>Sales representatives and managers use historical sales data to make informed predictions about future sales trends. Customers use SAP ERP Central Component (ECC) to manage planning for the manufacturing, sale, and distribution of goods. The sales and distribution (SD) module within SAP ECC helps manage sales orders. SAP systems are the primary source of historical sales data.</p>\n<p>Sales representatives and managers have the domain knowledge and in-depth understanding of their sales data. However, they lack data science and programming skills to create machine learning (ML) models that can generate sales forecasts. They seek intuitive, simple-to-use tools to create ML models without writing a single line of code.</p>\n<p>To help organizations achieve the agility and effectiveness that business analysts seek, we <a href=\"https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-canvas-a-visual-no-code-machine-learning-capability-for-business-analysts/\" target=\"_blank\">introduced</a> <a href=\"https://aws.amazon.com/sagemaker/canvas/\" target=\"_blank\">Amazon SageMaker Canvas</a>, a no-code ML solution that helps you accelerate delivery of ML solutions down to hours or days. SageMaker Canvas enables analysts to easily use available data in data lakes, data warehouses, and operational data stores; build ML models; and use them to make predictions interactively and for batch scoring on bulk datasets—all without writing a single line of code.</p>\n<p>In this post, we show how to bring sales order data from SAP ECC to generate sales forecasts using an ML model built using SageMaker Canvas.</p>\n<h4><a id=\"Solution_overview_11\"></a><strong>Solution overview</strong></h4>\n<p>To generate sales forecasts using SAP sales data, we need the collaboration of two personas: data engineers and business analysts (sales representatives and managers). Data engineers are responsible for configuring the data export from the SAP system to <a href=\"http://aws.amazon.com/s3\" target=\"_blank\">Amazon Simple Storage</a> Service (Amazon S3) using <a href=\"https://aws.amazon.com/appflow/\" target=\"_blank\">Amazon AppFlow</a>, which business analysts can then run either on-demand or automatically (schedule-based) to refresh SAP data in the S3 bucket. Business analysts are then responsible for generating forecasts with the exported data using SageMaker Canvas. The following diagram illustrates this workflow.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/968457e7e8b44dceb4b20e7cb88d0f35_image.png\" alt=\"image.png\" /></p>\n<p>For this post, we use SAP <a href=\"https://help.sap.com/viewer/a602ff71a47c441bb3000504ec938fea/7.51.13/en-US/57e9f59831e94311852a2af18ab733b5.html\" target=\"_blank\">NetWeaver Enterprise Procurement Model</a> (EPM) for the sample data. EPM is generally used for demonstration and testing purposes in SAP. It uses common business process model and follows the business object (BO) paradigm to support a well-defined business logic. We used the SAP transaction SEPM_DG (data generator) to generate around 80,000 historical sales orders and created a HANA CDS view to aggregate the data by product ID, sales date, and city, as shown in the following code:</p>\n<pre><code class=\"lang-\">@AbapCatalog.sqlViewName: 'ZCDS_EPM_VIEW'\n@AbapCatalog.compiler.compareFilter: true\n@AbapCatalog.preserveKey: true\n@AccessControl.authorizationCheck: #CHECK\n@EndUserText.label: 'Sagemaker canvas sales order'\n@OData.publish: true \ndefine view ZCDS_EPM as select from epm_v_sales_data as sd\ninner join epm_v_bp as bp\n on sd.bp_id = bp.bp_id {\n key sd.product_id as productid,\n bp.city,\n concat( cast(\n Concat(\n Concat(\n Concat(substring(cast (sd.created_at as abap.char( 30 )), 1, 4), '-'),\n Concat(substring(cast (sd.created_at as abap.char( 30 )), 5, 2), '-')\n ),\n Substring(cast (sd.created_at as abap.char( 30 )), 7, 2)\n )\n as char10 preserving type),' 00:00:00') as saledate,\n cast(sum(sd.gross_amount) as abap.dec( 15, 3 )) as totalsales \n}\ngroup by sd.product_id,sd.created_at, bp.city\n</code></pre>\n<p>In the next section, we expose this view using SAP OData services as ABAP structure, which allows us to extract the data with Amazon AppFlow.</p>\n<p>The following table shows the representative historical sales data from SAP, which we use in this post.</p>\n<p><img src=\"\" alt=\"image.png\" rel=\"1\" /></p>\n<p>The data file is daily frequency historical data. It has four columns (<code>productid</code>, <code>saledate</code>, <code>city</code>, and <code>totalsales</code>). We use SageMaker Canvas to build an ML model that is used to forecast <code>totalsales</code>for <code>productid </code>in a particular city.</p>\n<p>This post has been organized to show the activities and responsibilities for both data engineers and business analysts to generate product sales forecasts.</p>\n<h4><a id=\"Data_engineer_Extract_transform_and_load_the_dataset_from_SAP_to_Amazon_S3_with_Amazon_AppFlow_54\"></a><strong>Data engineer: Extract, transform, and load the dataset from SAP to Amazon S3 with Amazon AppFlow</strong></h4>\n<p>The first task you perform as a data engineer is to run an extract, transform, and load (ETL) job on historical sales data from SAP ECC to an S3 bucket, which the business analyst uses as the source dataset for their forecasting model. For this, we use Amazon AppFlow, because it provides an out-of-the-box <a href=\"https://docs.aws.amazon.com/appflow/latest/userguide/sapodata.html\" target=\"_blank\">SAP OData Connector</a> for ETL (as shown in the following diagram), with a simple UI to set up everything needed to configure the connection from the SAP ECC to the S3 bucket.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/053ad7ec9d894ad69fe0dae9a88b36c4_image.png\" alt=\"image.png\" /></p>\n<h4><a id=\"Prerequisites_60\"></a><strong>Prerequisites</strong></h4>\n<p>The following are requirements to integrate Amazon AppFlow with SAP:</p>\n<ul>\n<li>SAP NetWeaver Stack version 7.40 SP02 or above</li>\n<li>Catalog service (OData v2.0/v2.0) enabled in SAP Gateway for service discovery</li>\n<li>Support for client-side pagination and query options for SAP OData Service</li>\n<li>HTTPS enabled connection to SAP</li>\n</ul>\n<h4><a id=\"Authentication_69\"></a><strong>Authentication</strong></h4>\n<p>Amazon AppFlow supports two authentication mechanisms to connect to SAP:</p>\n<ul>\n<li>Basic – Authenticates using SAP OData user name and password.</li>\n<li>OAuth 2.0 – Uses OAuth 2.0 configuration with an identity provider. OAuth 2.0 must be enabled for OData v2.0/v2.0 services.</li>\n</ul>\n<h4><a id=\"Connection_75\"></a><strong>Connection</strong></h4>\n<p>Amazon AppFlow can connect to SAP ECC using a public SAP OData interface or a private connection. A private connection improves data privacy and security by transferring data through the private AWS network instead of the public internet. A private connection uses the VPC endpoint service for the SAP OData instance running in a VPC. The VPC endpoint service must have the Amazon AppFlow service principal<code>appflow.amazonaws.com</code> as an allowed principal and must be available in at least more than 50% of the Availability Zones in an AWS Region.</p>\n<h4><a id=\"Set_up_a_flow_in_Amazon_AppFlow_78\"></a><strong>Set up a flow in Amazon AppFlow</strong></h4>\n<p>We configure a new flow in Amazon AppFlow to run an ETL job on data from SAP to an S3 bucket. This flow allows for configuration of the SAP OData Connector as source, S3 bucket as destination, OData object selection, data mapping, data validation, and data filtering.</p>\n<ol>\n<li>Configure the SAP OData Connector as a data source by providing the following information:<br />\na. Application host URL<br />\nb. Application service path (catalog path)<br />\nc. Port number<br />\nd. Client number<br />\ne. Logon language<br />\nf. Connection type (private link or public)<br />\ng. Authentication mode<br />\nh. Connection name for the configuration</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/e91625935c664aa1ac39d50260c471d9_image.png\" alt=\"image.png\" /></p>\n<ol start=\"2\">\n<li>After you configure the source, choose the OData object and subobject for the sales orders.<br />\nGenerally, sales data from SAP is exported at a certain frequency, such as monthly or quarterly for the full size. For this post, choose the subobject option for the full-size export.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/7e6ecb158b0a4238a7c22fcb0e5c8276_image.png\" alt=\"image.png\" /></p>\n<ol start=\"3\">\n<li>Choose the S3 bucket as the destination.<br />\nThe flow exports data to this bucket.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/68fdba9feb9c4cfaa3828a80f8cd0bff_image.png\" alt=\"image.png\" /></p>\n<ol start=\"4\">\n<li>For <strong>Data format preference</strong>, select <strong>CSV format</strong>.</li>\n<li>For <strong>Data transfer preference</strong>, select <strong>Aggregate all records</strong>.</li>\n<li>For <strong>Filename preference</strong>, select <strong>Add a timestamp to the file name</strong>.</li>\n<li>For <strong>Folder structure preference</strong>, select <strong>No timestamped folder</strong>.<br />\nThe <strong>record aggregation configuration</strong> exports the full-size sales data from SAP combined in a single file. The file name ends with a timestamp in the YYYY-MM-DDTHH:mm:ss format in a single folder (flow name) within the S3 bucket. SageMaker Canvas imports data from this single file for model training and forecasting.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/2301826da2b3433a86df6756cf87faf3_image.png\" alt=\"image.png\" /></p>\n<ol start=\"8\">\n<li>Configure data mapping and validations to map the source data fields to destination data fields, and enable data validation rules as required.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/3ec7a96865b6459d94c30afb091248d5_image.png\" alt=\"image.png\" /></p>\n<ol start=\"9\">\n<li>You also configure data filtering conditions to filter out specific records if your requirement demands.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/a206d7f27992436a9eae8155e4090deb_image.png\" alt=\"image.png\" /></p>\n<ol start=\"10\">\n<li>Configure your flow trigger to decide whether the flow runs manually on-demand or automatically based on a schedule.<br />\nWhen configured for a schedule, the frequency is based on how frequently the forecast needs to be generated (generally monthly, quarterly, or half-yearly).</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/8056730e8115429f96f06ac61ccde0a9_image.png\" alt=\"image.png\" /></p>\n<p>After the flow is configured, the business analysts can run it on demand or based on the schedule to perform an ETL job on the sales order data from SAP to an S3 bucket.<br />\n11. In addition to the Amazon AppFlow configuration, the data engineers also need to configure an <a href=\"http://aws.amazon.com/iam\" target=\"_blank\">AWS Identity and Access Management</a> (IAM) role for SageMaker Canvas so that it can access other AWS services. For instructions, refer to <a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/canvas-set-up-forecast.html\" target=\"_blank\">Give your users permissions to perform time series forecasting</a>.</p>\n<h4><a id=\"Business_analyst_Use_the_historical_sales_data_to_train_a_forecasting_model_128\"></a><strong>Business analyst: Use the historical sales data to train a forecasting model</strong></h4>\n<p>Let’s switch gears and move to the business analyst side. As a business analyst, we’re looking for a visual, point-and-click service that makes it easy to build ML models and generate accurate predictions without writing a single line of code or having ML expertise. SageMaker Canvas fits the requirement as no-code ML solution.</p>\n<p>First, make sure that your IAM role is configured in such a way that SageMaker Canvas can access other AWS services. For more information, refer to <a href=\"https://docs.aws.amazon.com/en_jp/sagemaker/latest/dg/canvas-set-up-forecast.html\" target=\"_blank\">Give your users permissions to perform time series forecasting</a>, or you can ask for help to your Cloud Engineering team.</p>\n<p>When the data engineer is done setting up the Amazon AppFlow-based ETL configuration, the historical sales data is available for you in an S3 bucket.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/aad3f3796bf1490dba548d5193ac6aed_image.png\" alt=\"image.png\" /></p>\n<p>You’re now ready to train a model with SageMaker Canvas! This typically involves four steps: importing data into the service, configuring the model training by selecting the appropriate model type, training the model, and finally generating forecasts using the model.</p>\n<h4><a id=\"Import_data_in_SageMaker_Canvas_140\"></a><strong>Import data in SageMaker Canvas</strong></h4>\n<p>First, launch the SageMaker Canvas app from the <a href=\"https://aws.amazon.com/sagemaker/\" target=\"_blank\">Amazon SageMaker</a> console or from your single sign-on access. If you don’t know how to do that, contact your administrator so that they can guide you through the process of setting up SageMaker Canvas. Make sure that you access the service in the same Region as the S3 bucket containing the historical dataset from SAP. You should see a screen like the following.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/1d861bad474c49d584c4e4f68a942e36_image.png\" alt=\"image.png\" /></p>\n<p>Then complete the following steps:</p>\n<ol>\n<li>In SageMaker Canvas, choose <strong>Datasets</strong> in the navigation pane.</li>\n<li>Choose <strong>Import</strong> to start importing data from the S3 bucket.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/57f6d011faf542028253038fdaee2583_image.png\" alt=\"image.png\" /></p>\n<ol start=\"3\">\n<li>On the import screen, choose the data file or object from the S3 bucket to import the training data.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/1313b7cdc2d343c3a723053a08079d87_image.png\" alt=\"image.png\" /></p>\n<p>You can import multiple datasets in SageMaker Canvas. It also supports creating joins between the datasets by choosing <strong>Join data</strong>, which is particularly useful when the training data is spread across multiple files.</p>\n<p><strong>Configure and train the model</strong></p>\n<p>After you import the data, complete the following steps:</p>\n<ol>\n<li>Choose <strong>Models</strong> in the navigation pane.</li>\n<li>Choose <strong>New model</strong> to start configuration for training the forecast model.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/d5cc89ded78349399fd639c2ade4c7b4_image.png\" alt=\"image.png\" /></p>\n<ol start=\"3\">\n<li>For the new model, give it a suitable name, such as <code>product_sales_forecast_model</code>.</li>\n<li>Select the sales dataset and choose <strong>Select dataset</strong>.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/3f53ffba7eef47d3aae49f3e0c8eb34e_image.png\" alt=\"image.png\" /></p>\n<p>After the dataset is selected, you can see data statistics and configure the model training on the Build tab.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/c1fd350d83d54500b26b0f31f52c3e7e_image.png\" alt=\"image.png\" /></p>\n<ol start=\"5\">\n<li>Select <strong>totalsales</strong> as the target column for the prediction.<br />\nYou can see <strong>Time series forecasting</strong> is automatically selected as the model type.</li>\n<li>Choose <strong>Configure</strong>.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/2027beb710154f89b2a4f93f4cac59b3_image.png\" alt=\"image.png\" /></p>\n<ol start=\"7\">\n<li>In the <strong>Time series forecasting</strong> configuration section, choose <strong>productid</strong>for <strong>Item ID column</strong>.</li>\n<li>Choose <strong>city</strong>for <strong>Group column</strong>.</li>\n<li>Choose <strong>saledate</strong>for <strong>Time stamp column</strong>.</li>\n<li>For <strong>Days</strong>, enter <code>120</code>.</li>\n<li>Choose <strong>Save</strong>.<br />\nThis configures the model to make forecasts for <code>totalsales </code>for 120 days using <code>saledate</code>based on historical data, which can be queried for <code>productid</code>and <code>city</code>.</li>\n</ol>\n<p><img src=\"https://dev-media.amazoncloud.cn/0c86b0fa08ef45ada431d528903ea637_image.png\" alt=\"image.png\" /></p>\n<ol start=\"12\">\n<li>When the model training configuration is complete, choose <strong>Standard Build</strong> to start the model training.</li>\n</ol>\n<p>The <strong>Preview model</strong> option is not available for time series forecasting model type. You can review the estimated time for the model training on the <strong>Analyze</strong> tab.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/632c7dd3291947afbfdf531a9d8337fc_image.png\" alt=\"image.png\" /></p>\n<p>Model training might take 1–4 hours to complete, depending on the data size. When the model is ready, you can use it to generate the forecast.</p>\n<h4><a id=\"Generate_a_forecast_201\"></a><strong>Generate a forecast</strong></h4>\n<p>When the model training is complete, it shows prediction accuracy of the model on the **Analyze **tab. For instance, in this example, it shows prediction accuracy as 92.87%.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/84920d136dd9453aa8acdbefc9c100f4_image.png\" alt=\"image.png\" /></p>\n<p>The forecast is generated on the **Predict **tab. You can generate forecasts for all the items or a selected single item. It also shows the date range for which the forecast can be generated.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/4cc3cd7aa08f4c82a8536202e488db08_image.png\" alt=\"image.png\" /></p>\n<p>As an example, choose the <strong>Single item</strong> option. Select <strong>P-2</strong> for <strong>Item</strong> and <strong>Quito</strong> for <strong>Group</strong> to generate a prediction for product P-2 for city Quito for the date range 2017-08-15 00:00:00 through 2017-12-13 00:00:00.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/e3f0742b74904dcd97036761d0812e31_image.png\" alt=\"image.png\" /></p>\n<p>The generated forecast shows the average forecast as well as the upper and lower bound of the forecast. The forecast bounds help configure an aggressive or balanced approach for the forecast handling.</p>\n<p>You can also download the generated forecast as a CSV file or image. The generated forecast CSV file is generally to used to work offline with the forecast data.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/94816028493943a7bbda0857733557c6_image.png\" alt=\"image.png\" /></p>\n<p>The forecast is now generated for the time series data. When a new baseline of data becomes available for the forecast, you can change the dataset in SageMaker Canvas to retrain the forecast model using the new baseline.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/77cea97dd194441596a7863a02010043_image.png\" alt=\"image.png\" /></p>\n<p>You can retrain the model multiple times as and when the training data changes.</p>\n<h4><a id=\"Clean_up_227\"></a><strong>Clean up</strong></h4>\n<p>To avoid incurring future <a href=\"https://aws.amazon.com/sagemaker/canvas/pricing\" target=\"_blank\">session charges</a>, log out of SageMaker Canvas.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/26bfea6caa7e47918c7e16307d2f9d3c_image.png\" alt=\"image.png\" /></p>\n<h4><a id=\"Conclusion_233\"></a><strong>Conclusion</strong></h4>\n<p>In this post, you learned how the Amazon AppFlow SAP OData Connector exports sales order data from the SAP system into an S3 bucket and then how to use SageMaker Canvas to build a model for forecasting.</p>\n<p>You can use SageMaker Canvas for any SAP time series data scenarios, such as expense or revenue prediction. The entire forecast generation process is configuration driven. Sales managers and representatives can generate sales forecasts repeatedly per month or per quarter with a refreshed set of data in a fast, straightforward, and intuitive way without writing a single line of code. This helps improve productivity and enables quick planning and decisions.</p>\n<p>To get started, learn more about SageMaker Canvas and Amazon AppFlow using the following resources:</p>\n<ul>\n<li><a href=\"https://docs.aws.amazon.com/sagemaker/latest/dg/canvas.html\" target=\"_blank\">Amazon SageMaker Canvas Developer Guide</a></li>\n<li><a href=\"https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-canvas-a-visual-no-code-machine-learning-capability-for-business-analysts/\" target=\"_blank\">Announcing Amazon SageMaker Canvas – a Visual, No Code Machine Learning Capability for Business Analysts</a></li>\n<li><a href=\"https://aws.amazon.com/blogs/awsforsap/extract-data-from-sap-erp-and-bw-with-amazon-appflow/\" target=\"_blank\">Extract data from SAP ERP and BW with Amazon AppFlow</a></li>\n<li><a href=\"https://docs.aws.amazon.com/appflow/latest/userguide/sapodata.html\" target=\"_blank\">SAP OData Connector configuration</a></li>\n</ul>\n<h4><a id=\"About_the_Authors_246\"></a><strong>About the Authors</strong></h4>\n<p><img src=\"https://dev-media.amazoncloud.cn/2f6c7654aa474d518319843117e58242_image.png\" alt=\"image.png\" /></p>\n<p><strong>Brajendra Singh</strong> is solution architect in Amazon Web Services working with enterprise customers. He has strong developer background and is a keen enthusiast for data and machine learning solutions.</p>\n<p><img src=\"https://dev-media.amazoncloud.cn/11acf9b344da4afc98b3d782f3501374_image.png\" alt=\"image.png\" /></p>\n<p><strong>Davide Gallitelli</strong> is a Specialist Solutions Architect for AI/ML in the EMEA region. He is based in Brussels and works closely with customers throughout Benelux. He has been a developer since he was very young, starting to code at the age of 7. He started learning AI/ML at university, and has fallen in love with it since then.</p>\n"}
目录
亚马逊云科技解决方案 基于行业客户应用场景及技术领域的解决方案
联系亚马逊云科技专家
亚马逊云科技解决方案
基于行业客户应用场景及技术领域的解决方案
联系专家
0
目录
关闭
contact-us