Your company’s customer and order databases are often under heavy load. This makes performing analytics against them difficult without harming operations.
The databases are in a MySQL cluster, with nightly backups taken using mysqldump. You want to perform analytics with minimal impact on operations. What should you do?
A. Add a node to the MySQL cluster and build an OLAP cube there.
B. Use an ETL tool to load the data from MySQL into Google BigQuery.
C. Connect an on-premises Apache Hadoop cluster to MySQL and perform ETL.
D. Mount the backups to Google Cloud SQL, and then process the data using Google Cloud Dataproc.
D is right because performing ETL will add load on MySQL Server. Also since backups are already mentioned in the question, they just need to be restored into CLoudSQL for Analysis.
I think answer is D
Right using OLAP cube in BigQuery will be expensive. So A is correct.
I disagree with myself, this is from a friends notes:
MYSQL
– You can easily add nodes to MySql Cluster Data Nodes and build cube to do
OLAP. (answer for a mock question)
I say B, the rest do not make much sense. But a better one could be start a cloudSQL using the nightly backup and perform the querys there…might be even cheaper than the big query requests…