Ingest Stripe data in a fast and reliable way using Stripe Data Pipeline for Amazon Redshift

海外精选
海外精选的内容汇集了全球优质的亚马逊云科技相关技术内容。同时,内容中提到的“AWS” 是 “Amazon Web Services” 的缩写,在此网站不作为商标展示。
0
0
{"value":"<p>Enterprises typically host a myriad of business applications for varying data needs. As companies grow, so does the demand for insights from a complete set of business data. Having data from various applications that store data in disparate silos can delay the decision-making process. However, building and maintaining an API integration or a third-party extract, transform, and load (ETL) pipeline to move data into a destination data store can be time-consuming and expensive.</p>\n\n<p>Today we’re delighted to introduce Stripe Data Pipeline for Amazon Redshift to help you access your Stripe data and extract insight securely and easily from <a href=\\"https://aws.amazon.com/redshift/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Redshift</a>. This data, including billing, issuing, and payment records, can be shared in a consistent and automated fashion. You can integrate your Stripe data with data from other sources in your [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) clusters to create a single source of truth.</p>\\n\\n<p>In this post, we discuss the benefits of Stripe Data Pipeline and some of its use cases.</p>\n\n### **Solution overview**\n\n<p>Amazon Redshift is a fast, fully managed, petabyte-scale cloud data warehousing service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence (BI) tools. It’s optimized for datasets ranging from a few hundred gigabytes to petabytes or more. This columnar data warehouse provides provisioned as well as serverless deployment options and uses an industry-standard SQL interface to analyze structured and semi-structured data with fast query performance.</p>\n\n<p>Stripe Data Pipeline is powered by Amazon Redshift’s latest <a href=\\"https://aws.amazon.com/redshift/features/ra3/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">RA3 instances</a>, which provide <a href=\\"https://aws.amazon.com/redshift/features/data-sharing/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">cross-account data sharing</a> capability. RA3 takes a performant, cost-effective approach to address rapidly growing data volume by decoupling data processing from managed storage. You can then scale compute and storage independently and only pay for what you use. Data sharing provides read access directly to data stored across [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) clusters without data movement. This capability removes the complexity and delays that are often associated with managing large distributed datasets across multiple accounts.</p>\\n\\n<p>The solution provides the following core features and benefits:</p>\n\n<ul> \\n <li><strong>Scalable and managed data pipeline</strong> – You don’t need to build, maintain, and scale custom ETL jobs. You can set up Stripe Data Pipeline in minutes, and it and scales automatically to handle increased business activities and data volume.</li> \\n <li><strong>Up-to-date financial data </strong>– You automatically receive and refresh a complete set of your Stripe data and reports in [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) on a low-latency schedule. Stripe Data Pipeline is built into Stripe and always provides accurate data.</li> \\n <li><strong>Security and compliance</strong> – Data is shared directly from Stripe with your [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster, and confidentiality of the data is protected in transit and at rest. [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) offers comprehensive security controls and monitoring via native integration with <a href=\\"http://aws.amazon.com/cloudtrail\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">AWS CloudTrail</a> and <a href=\\"http://aws.amazon.com/cloudwatch\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon CloudWatch</a> (for more information, see <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/db-auditing.html#rs-db-auditing-cloud-trail\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Logging Amazon Redshift API calls with AWS CloudTrail</a> and <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/metrics-listing.html\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Monitoring Amazon Redshift using CloudWatch metrics</a>, respectively). You can define and audit who has access to what and ensure the compliance requirements are met.</li> \\n <li><strong>Extensibility</strong> – Once the data is accessible in AWS, you benefit from the breadth of native integrations [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) supports. You can join datasets from other data stores in operational databases, build reports and dashboards with BI tools, or identify patterns and generate prediction using <a href=\\"https://aws.amazon.com/redshift/features/redshift-ml/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Redshift ML</a>.</li> \\n </ul>\n\n<p>The following architecture diagram provides a quick overview of how data sharing works and how other AWS services can be used together. We dive deeper into different use cases in the following sections.</p>\n\n![image.png](https://dev-media.amazoncloud.cn/13bee3843bc342b5a2691b5e595d99e1_image.png)\n\n### **Accept datashares from Stripe**\n\n<p>You can configure the solution in a few steps with no code necessary.</p>\n\n<p>Once Stripe creates a datashare from the producer cluster and authorizes your AWS account, you can view this datashare on your Amazon Redshift console. You need to associate it with specific or all clusters in your AWS account as the consumer. Clusters can be specified by namespaces as globally unique identifiers. Next, you create a database from the datashare in order to start querying data.</p>\n\n### **Query data from the consumer [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster**\n\n<p>You can now access your Stripe data and schema directly from Amazon Redshift’s web-based query editor. This direct connection enables teams to pull accurate analysis of various functions of the business. For example:</p>\n\n<ul> \\n <li><strong>Finance</strong> – “How does my cash flow change based on seasonality?”</li> \\n <li><strong>Sales</strong> – “How many customers do we have in the US?”</li> \\n <li><strong>Product</strong> – “How many active users do we have on each subscription plan?”</li> \\n <li><strong>Sales operations</strong> – “Which customers haven’t paid their invoices?”</li> \\n </ul>\n\n<p>The following screenshot shows an example in which the query editor displays the number of charges blocked per Stripe’s connected account.</p>\n\n![image.png](https://dev-media.amazoncloud.cn/d8cdfd4d79f64a7a87a208cefaacbfd1_image.png)\n\n### **Use federated queries**\n\n<p>The <a href=\\"https://aws.amazon.com/redshift/modern-data-architecture/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">modern data architecture</a> of [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) enables you to store data in purpose-built data stores based on specific use cases, and allows querying external databases on <a href=\\"http://aws.amazon.com/rds\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Relational Database Service</a> ([Amazon RDS](https://aws.amazon.com/cn/rds/?trk=cndc-detail)) or datasets in an <a href=\\"http://aws.amazon.com/s3\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Simple Storage Service</a> ([Amazon S3](https://aws.amazon.com/cn/s3/?trk=cndc-detail)) data lake without moving these datasets to [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) clusters. You can drive deeper by incorporating data from [Amazon RDS](https://aws.amazon.com/cn/rds/?trk=cndc-detail), or from an S3 data lake through <a href=\\"https://docs.aws.amazon.com/redshift/latest/dg/c-getting-started-using-spectrum.html\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Redshift Spectrum</a>. This capability provides a native integration without requiring additional ETL jobs.</p>\\n\\n<p>The following syntax allows you to create an external schema from an <a href=\\"https://aws.amazon.com/rds/aurora/mysql-features/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Aurora MySQL-Compatible Edition</a> database to an [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster. [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) assumes an <a href=\\"http://aws.amazon.com/iam\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">AWS Identity and Access Management</a> (IAM) role and uses <a href=\\"https://aws.amazon.com/secrets-manager/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">AWS Secrets Manager</a> to access external data stores. For more information and examples with other supported data stores, refer to <a href=\\"https://docs.aws.amazon.com/redshift/latest/dg/federated-overview.html\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Querying data with federated queries in Amazon Redshift</a>.</p>\\n\\n<div class=\\"hide-language\\"> \\n <div class=\\"code-toolbar\\"><pre class=\\" language-sql\\"><code class=\\" language-sql\\"><span class=\\"token keyword\\">CREATE</span> EXTERNAL <span class=\\"token keyword\\">SCHEMA</span> auroram\n<span class=\\"token keyword\\">FROM</span> MYSQL\n<span class=\\"token keyword\\">DATABASE</span> ‘example_database’ <span class=\\"token keyword\\">SCHEMA</span> <span class=\\"token string\\">'example_schema'</span> <span class=\\"token comment\\">-- schema is optional</span>\nURI ‘hostname’\nIAM_ROLE ‘iam_role_arn’\nSECRET_ARN ‘aws_secrets_manager_arn’<span class=\\"token punctuation\\">;</span> </code></pre><div class=\\"toolbar\\"><div class=\\"toolbar-item\\"><span></span></div></div></div> \\n </div>\n\n<p>Coming back to Stripe Data Pipeline, now you can combine the data from an Aurora table and create further analysis. For example, you can correlate the trends of customer acquisition against sales campaign by region, so you can gain an understanding of the campaign effectiveness and make adjustment to marketing strategies.</p>\n\n### **Create visualizations and dashboards**\n\n<p>Now that your complete set of business data is accessible from Amazon Redshift, you can start to explore the data and create visualizations. <a href=\\"https://aws.amazon.com/quicksight/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon QuickSight</a> is a serverless BI service that allows you to easily connect to a data source, create analyses, publish dashboards, and share between teams. QuickSight seamlessly integrates with AWS services such as [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail), [Amazon S3](https://aws.amazon.com/cn/s3/?trk=cndc-detail), and many more.</p>\\n\\n<p>The following screenshot illustrates how straightforward it is to connect an Amazon Redshift instance to QuickSight as a new data source.</p>\n\n![image.png](https://dev-media.amazoncloud.cn/7401aad9c698491aa152aa16dda54261_image.png)\n\n<p>The following screenshot is of a sample QuickSight dashboard pulling data from Amazon Redshift.</p>\n\n![image.png](https://dev-media.amazoncloud.cn/a2e8753d181d4a38bc28f45696624698_image.png)\n\n### **Key considerations**\n\n<p>When using Stripe Data Pipeline, consider the following:</p>\n\n<ul> \\n <li><strong>Instance type</strong> – This solution is available for all RA3 node types. If you run an existing DS2 or DC2 cluster, there are multiple options to migrate to RA3, including elastic resize, snapshot and restore, and classic resize. For more information, including an upgrade sizing reference between different node types, refer to <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-clusters.html#rs-upgrading-to-ra3\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Upgrading to RA3 node types</a>.</li> \\n <li><strong>RI migration </strong>– If you have [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) Reserved Instances (RIs), you can use the <a href=\\"https://aws.amazon.com/about-aws/whats-new/2021/11/amazon-redshift-ra3-reserved-instance-migration/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">RI migration feature </a>to migrate the DS2 RI clusters to equivalent RA3 RI clusters as part of a cross-instance resize or cross-instance snapshot restore operation. The RA3 RI covering the new cluster will be the same cost and on the same calendar terms as the original DS2 RI for supported configurations.</li> \\n <li><strong>Encryption</strong> – The consumer cluster must be <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-db-encryption.html\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">encrypted</a> as part of the enhanced security control for cross-account sharing. You can enable encryption at cluster creation time, or modify an unencrypted cluster with either <a href=\\"https://aws.amazon.com/kms/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">AWS Key Management Service</a> (AWS KMS) or <a href=\\"https://aws.amazon.com/cloudhsm\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">AWS CloudHSM</a>.</li> \\n <li><strong>Federated queries </strong>– This capability works with external DB instances, including <a href=\\"https://aws.amazon.com/rds/postgresql/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon RDS for PostgreSQL</a>, <a href=\\"https://aws.amazon.com/rds/aurora/postgresql-features/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon Aurora PostgreSQL-Compatible Edition</a>, <a href=\\"https://aws.amazon.com/rds/mysql/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Amazon RDS for MySQL</a>, and&nbsp;<a class=\\"c-link\\" href=\\"https://aws.amazon.com/rds/aurora/faqs/\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\" data-stringify-link=\\"https://aws.amazon.com/rds/aurora/faqs/\\" data-sk=\\"tooltip_parent\\" aria-describedby=\\"sk-tooltip-5428\\">Aurora MySQL-Compatible Edition</a>. You should also ensure that you have an [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster with a cluster maintenance version that supports federated queries.</li> \\n </ul>\n\n### **Conclusion**\n\n<p>In this post, we introduced Stripe Data Pipeline for Amazon Redshift and discussed options to further integrate with AWS services. Stripe Data Pipeline removes the need to build custom API integration or adopt a third-party ETL pipeline, making data accessible with a few clicks and with no code required. Businesses can automatically receive up-to-date data from Stripe in their data warehouse on AWS, reduce data silos, and extract deep insights to address business needs.</p>\n\n<p>Check out <a href=\\"http://stripe.com/data-pipeline\\" target=\\"_blank\\" rel=\\"noopener noreferrer\\">Stripe Data Pipeline</a> for more information about the solution and how to get started.</p>\\n\\n#### **About the Authors**\\n\\n![image.png](https://dev-media.amazoncloud.cn/eac04f27b66346e5b263ad9f8f4d883b_image.png)\\n\\n<p style=\\"clear: both\\"><strong>Jessica Ho</strong> is a Sr. Partner Solutions Architect at AWS supporting ISV partners who build business applications. She is passionate about creating differentiated solutions that promote cloud adoption. Outside of work, she enjoys spoiling her garden into a mini jungle.</p>\\n\\n![image.png](https://dev-media.amazoncloud.cn/e142743d70f64899816a531a28a4979e_image.png)\\n\\n<p style=\\"clear: both\\"><strong>Alexander Mahabir</strong>&nbsp;is a Sr. Partner Solutions Architect at AWS based in the D.C metropolitan area. Alex has over 16 year of experience building cloud, and on-premise solutions for small, medium, and large enterprises. Alex currently works with ISV partners in the Digital Customer Experience segment.</p>","render":"<p>Enterprises typically host a myriad of business applications for varying data needs. As companies grow, so does the demand for insights from a complete set of business data. Having data from various applications that store data in disparate silos can delay the decision-making process. However, building and maintaining an API integration or a third-party extract, transform, and load (ETL) pipeline to move data into a destination data store can be time-consuming and expensive.</p>\n<p>Today we’re delighted to introduce Stripe Data Pipeline for Amazon Redshift to help you access your Stripe data and extract insight securely and easily from <a href=\\"https://aws.amazon.com/redshift/\\" target=\\"_blank\\">Amazon Redshift</a>. This data, including billing, issuing, and payment records, can be shared in a consistent and automated fashion. You can integrate your Stripe data with data from other sources in your [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) clusters to create a single source of truth.</p>\\n<p>In this post, we discuss the benefits of Stripe Data Pipeline and some of its use cases.</p>\n<h3><a id=\\"Solution_overview_6\\"></a><strong>Solution overview</strong></h3>\\n<p>Amazon Redshift is a fast, fully managed, petabyte-scale cloud data warehousing service that makes it simple and cost-effective to efficiently analyze all your data using your existing business intelligence (BI) tools. It’s optimized for datasets ranging from a few hundred gigabytes to petabytes or more. This columnar data warehouse provides provisioned as well as serverless deployment options and uses an industry-standard SQL interface to analyze structured and semi-structured data with fast query performance.</p>\n<p>Stripe Data Pipeline is powered by Amazon Redshift’s latest <a href=\\"https://aws.amazon.com/redshift/features/ra3/\\" target=\\"_blank\\">RA3 instances</a>, which provide <a href=\\"https://aws.amazon.com/redshift/features/data-sharing/\\" target=\\"_blank\\">cross-account data sharing</a> capability. RA3 takes a performant, cost-effective approach to address rapidly growing data volume by decoupling data processing from managed storage. You can then scale compute and storage independently and only pay for what you use. Data sharing provides read access directly to data stored across [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) clusters without data movement. This capability removes the complexity and delays that are often associated with managing large distributed datasets across multiple accounts.</p>\\n<p>The solution provides the following core features and benefits:</p>\n<ul> \\n <li><strong>Scalable and managed data pipeline</strong> – You don’t need to build, maintain, and scale custom ETL jobs. You can set up Stripe Data Pipeline in minutes, and it and scales automatically to handle increased business activities and data volume.</li> \\n <li><strong>Up-to-date financial data </strong>– You automatically receive and refresh a complete set of your Stripe data and reports in [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) on a low-latency schedule. Stripe Data Pipeline is built into Stripe and always provides accurate data.</li> \\n <li><strong>Security and compliance</strong> – Data is shared directly from Stripe with your [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster, and confidentiality of the data is protected in transit and at rest. [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) offers comprehensive security controls and monitoring via native integration with <a href=\\"http://aws.amazon.com/cloudtrail\\" target=\\"_blank\\">AWS CloudTrail</a> and <a href=\\"http://aws.amazon.com/cloudwatch\\" target=\\"_blank\\">Amazon CloudWatch</a> (for more information, see <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/db-auditing.html#rs-db-auditing-cloud-trail\\" target=\\"_blank\\">Logging Amazon Redshift API calls with AWS CloudTrail</a> and <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/metrics-listing.html\\" target=\\"_blank\\">Monitoring Amazon Redshift using CloudWatch metrics</a>, respectively). You can define and audit who has access to what and ensure the compliance requirements are met.</li> \\n <li><strong>Extensibility</strong> – Once the data is accessible in AWS, you benefit from the breadth of native integrations [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) supports. You can join datasets from other data stores in operational databases, build reports and dashboards with BI tools, or identify patterns and generate prediction using <a href=\\"https://aws.amazon.com/redshift/features/redshift-ml/\\" target=\\"_blank\\">Amazon Redshift ML</a>.</li> \\n </ul>\n<p>The following architecture diagram provides a quick overview of how data sharing works and how other AWS services can be used together. We dive deeper into different use cases in the following sections.</p>\n<p><img src=\\"https://dev-media.amazoncloud.cn/13bee3843bc342b5a2691b5e595d99e1_image.png\\" alt=\\"image.png\\" /></p>\n<h3><a id=\\"Accept_datashares_from_Stripe_25\\"></a><strong>Accept datashares from Stripe</strong></h3>\\n<p>You can configure the solution in a few steps with no code necessary.</p>\n<p>Once Stripe creates a datashare from the producer cluster and authorizes your AWS account, you can view this datashare on your Amazon Redshift console. You need to associate it with specific or all clusters in your AWS account as the consumer. Clusters can be specified by namespaces as globally unique identifiers. Next, you create a database from the datashare in order to start querying data.</p>\n<h3><a id=\\"Query_data_from_the_consumer_Amazon_Redshift_cluster_31\\"></a><strong>Query data from the consumer Amazon Redshift cluster</strong></h3>\\n<p>You can now access your Stripe data and schema directly from Amazon Redshift’s web-based query editor. This direct connection enables teams to pull accurate analysis of various functions of the business. For example:</p>\n<ul> \\n <li><strong>Finance</strong> – “How does my cash flow change based on seasonality?”</li> \\n <li><strong>Sales</strong> – “How many customers do we have in the US?”</li> \\n <li><strong>Product</strong> – “How many active users do we have on each subscription plan?”</li> \\n <li><strong>Sales operations</strong> – “Which customers haven’t paid their invoices?”</li> \\n </ul>\n<p>The following screenshot shows an example in which the query editor displays the number of charges blocked per Stripe’s connected account.</p>\n<p><img src=\\"https://dev-media.amazoncloud.cn/d8cdfd4d79f64a7a87a208cefaacbfd1_image.png\\" alt=\\"image.png\\" /></p>\n<h3><a id=\\"Use_federated_queries_46\\"></a><strong>Use federated queries</strong></h3>\\n<p>The <a href=\\"https://aws.amazon.com/redshift/modern-data-architecture/\\" target=\\"_blank\\">modern data architecture</a> of [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) enables you to store data in purpose-built data stores based on specific use cases, and allows querying external databases on <a href=\\"http://aws.amazon.com/rds\\" target=\\"_blank\\">Amazon Relational Database Service</a> ([Amazon RDS](https://aws.amazon.com/cn/rds/?trk=cndc-detail)) or datasets in an <a href=\\"http://aws.amazon.com/s3\\" target=\\"_blank\\">Amazon Simple Storage Service</a> ([Amazon S3](https://aws.amazon.com/cn/s3/?trk=cndc-detail)) data lake without moving these datasets to [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) clusters. You can drive deeper by incorporating data from [Amazon RDS](https://aws.amazon.com/cn/rds/?trk=cndc-detail), or from an S3 data lake through <a href=\\"https://docs.aws.amazon.com/redshift/latest/dg/c-getting-started-using-spectrum.html\\" target=\\"_blank\\">Amazon Redshift Spectrum</a>. This capability provides a native integration without requiring additional ETL jobs.</p>\\n<p>The following syntax allows you to create an external schema from an <a href=\\"https://aws.amazon.com/rds/aurora/mysql-features/\\" target=\\"_blank\\">Amazon Aurora MySQL-Compatible Edition</a> database to an [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster. [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) assumes an <a href=\\"http://aws.amazon.com/iam\\" target=\\"_blank\\">AWS Identity and Access Management</a> (IAM) role and uses <a href=\\"https://aws.amazon.com/secrets-manager/\\" target=\\"_blank\\">AWS Secrets Manager</a> to access external data stores. For more information and examples with other supported data stores, refer to <a href=\\"https://docs.aws.amazon.com/redshift/latest/dg/federated-overview.html\\" target=\\"_blank\\">Querying data with federated queries in Amazon Redshift</a>.</p>\\n<div> \\n <div><pre><code><span>CREATE</span> EXTERNAL <span>SCHEMA</span> auroram\n<span>FROM</span> MYSQL\n<span>DATABASE</span> ‘example_database’ <span>SCHEMA</span> <span>'example_schema'</span> <span>-- schema is optional</span>\nURI ‘hostname’\nIAM_ROLE ‘iam_role_arn’\nSECRET_ARN ‘aws_secrets_manager_arn’<span>;</span> </code></pre><div><div><span></span></div></div></div> \\n </div>\n<p>Coming back to Stripe Data Pipeline, now you can combine the data from an Aurora table and create further analysis. For example, you can correlate the trends of customer acquisition against sales campaign by region, so you can gain an understanding of the campaign effectiveness and make adjustment to marketing strategies.</p>\n<h3><a id=\\"Create_visualizations_and_dashboards_63\\"></a><strong>Create visualizations and dashboards</strong></h3>\\n<p>Now that your complete set of business data is accessible from Amazon Redshift, you can start to explore the data and create visualizations. <a href=\\"https://aws.amazon.com/quicksight/\\" target=\\"_blank\\">Amazon QuickSight</a> is a serverless BI service that allows you to easily connect to a data source, create analyses, publish dashboards, and share between teams. QuickSight seamlessly integrates with AWS services such as [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail), [Amazon S3](https://aws.amazon.com/cn/s3/?trk=cndc-detail), and many more.</p>\\n<p>The following screenshot illustrates how straightforward it is to connect an Amazon Redshift instance to QuickSight as a new data source.</p>\n<p><img src=\\"https://dev-media.amazoncloud.cn/7401aad9c698491aa152aa16dda54261_image.png\\" alt=\\"image.png\\" /></p>\n<p>The following screenshot is of a sample QuickSight dashboard pulling data from Amazon Redshift.</p>\n<p><img src=\\"https://dev-media.amazoncloud.cn/a2e8753d181d4a38bc28f45696624698_image.png\\" alt=\\"image.png\\" /></p>\n<h3><a id=\\"Key_considerations_75\\"></a><strong>Key considerations</strong></h3>\\n<p>When using Stripe Data Pipeline, consider the following:</p>\n<ul> \\n <li><strong>Instance type</strong> – This solution is available for all RA3 node types. If you run an existing DS2 or DC2 cluster, there are multiple options to migrate to RA3, including elastic resize, snapshot and restore, and classic resize. For more information, including an upgrade sizing reference between different node types, refer to <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-clusters.html#rs-upgrading-to-ra3\\" target=\\"_blank\\">Upgrading to RA3 node types</a>.</li> \\n <li><strong>RI migration </strong>– If you have [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) Reserved Instances (RIs), you can use the <a href=\\"https://aws.amazon.com/about-aws/whats-new/2021/11/amazon-redshift-ra3-reserved-instance-migration/\\" target=\\"_blank\\">RI migration feature </a>to migrate the DS2 RI clusters to equivalent RA3 RI clusters as part of a cross-instance resize or cross-instance snapshot restore operation. The RA3 RI covering the new cluster will be the same cost and on the same calendar terms as the original DS2 RI for supported configurations.</li> \\n <li><strong>Encryption</strong> – The consumer cluster must be <a href=\\"https://docs.aws.amazon.com/redshift/latest/mgmt/working-with-db-encryption.html\\" target=\\"_blank\\">encrypted</a> as part of the enhanced security control for cross-account sharing. You can enable encryption at cluster creation time, or modify an unencrypted cluster with either <a href=\\"https://aws.amazon.com/kms/\\" target=\\"_blank\\">AWS Key Management Service</a> (AWS KMS) or <a href=\\"https://aws.amazon.com/cloudhsm\\" target=\\"_blank\\">AWS CloudHSM</a>.</li> \\n <li><strong>Federated queries </strong>– This capability works with external DB instances, including <a href=\\"https://aws.amazon.com/rds/postgresql/\\" target=\\"_blank\\">Amazon RDS for PostgreSQL</a>, <a href=\\"https://aws.amazon.com/rds/aurora/postgresql-features/\\" target=\\"_blank\\">Amazon Aurora PostgreSQL-Compatible Edition</a>, <a href=\\"https://aws.amazon.com/rds/mysql/\\" target=\\"_blank\\">Amazon RDS for MySQL</a>, and&nbsp;<a href=\\"https://aws.amazon.com/rds/aurora/faqs/\\" target=\\"_blank\\">Aurora MySQL-Compatible Edition</a>. You should also ensure that you have an [Amazon Redshift](https://aws.amazon.com/cn/redshift/?trk=cndc-detail) cluster with a cluster maintenance version that supports federated queries.</li> \\n </ul>\n<h3><a id=\\"Conclusion_86\\"></a><strong>Conclusion</strong></h3>\\n<p>In this post, we introduced Stripe Data Pipeline for Amazon Redshift and discussed options to further integrate with AWS services. Stripe Data Pipeline removes the need to build custom API integration or adopt a third-party ETL pipeline, making data accessible with a few clicks and with no code required. Businesses can automatically receive up-to-date data from Stripe in their data warehouse on AWS, reduce data silos, and extract deep insights to address business needs.</p>\n<p>Check out <a href=\\"http://stripe.com/data-pipeline\\" target=\\"_blank\\">Stripe Data Pipeline</a> for more information about the solution and how to get started.</p>\\n<h4><a id=\\"About_the_Authors_92\\"></a><strong>About the Authors</strong></h4>\\n<p><img src=\\"https://dev-media.amazoncloud.cn/eac04f27b66346e5b263ad9f8f4d883b_image.png\\" alt=\\"image.png\\" /></p>\n<p><strong>Jessica Ho</strong> is a Sr. Partner Solutions Architect at AWS supporting ISV partners who build business applications. She is passionate about creating differentiated solutions that promote cloud adoption. Outside of work, she enjoys spoiling her garden into a mini jungle.</p>\\n<p><img src=\\"https://dev-media.amazoncloud.cn/e142743d70f64899816a531a28a4979e_image.png\\" alt=\\"image.png\\" /></p>\n<p><strong>Alexander Mahabir</strong>&nbsp;is a Sr. Partner Solutions Architect at AWS based in the D.C metropolitan area. Alex has over 16 year of experience building cloud, and on-premise solutions for small, medium, and large enterprises. Alex currently works with ISV partners in the Digital Customer Experience segment.</p>"}
目录
亚马逊云科技解决方案 基于行业客户应用场景及技术领域的解决方案
联系亚马逊云科技专家
亚马逊云科技解决方案
基于行业客户应用场景及技术领域的解决方案
联系专家
0
目录
关闭