TA’s comment: An excellent start Group 3. Thanks. Very through examination. One thing I find confusing. Your section 3 opens with “This section will explore the advantages and disadvantages of two data lake architectures: Apache Hadoop and Microsoft Azure” Neither one of those is really a data lake. Azure is a cloud platform. Your Section 3 is fine. Great, in fact. The opening sentence is off. Recommended fix is on the sentence not the section. Also, Uber is not public yet.

My group member’s email: I agree – I posted below the requirements from Canvas and below it, the additional pieces that Mark sent in his follow-up note.
I think we need we’re definitely on track but seems like a few pieces we may need to add/amend.
Requirements on Canvas
1. Database Engine – I think we covered this section well
o Compare Risk/Benefits of at least two Database Engines
o Speak to how the chosen engine will support the business needs
2. Data Lake Components
o Data Lake Architecture – I think we covered this section well
 Compare Risk/Benefits of at least two Data Lake configurations
 Speak to how the chosen Configuration will support the business needs
o Recovery/Continuity of Business- I’m not sure we sufficiently covered this. What do you guys think? See below components
 Compare Risk/Benefits of at least two Data Lake configurations
 Speak to how the chosen Configuration will support the business needs
3. Data Governance- Did we cover this piece?

Additional Guidance from Mark’s email
I think the two items in yellow we need to cover. I think we can rework the Business Requirements and use it towrds the Executive Summary. I can take a stab on that.

1. Open with and Executive Summary – I think we can rework what is now “Business Case” to be Executive Summary. It’s mean to basically be the very short version of the entire paper so that if someone only had time to read that, they’d get the entire gist. — I will take a stab at that.
2. Start with the Business Requirements- Business Requirements – I think we include pieces of this in the tables we have, but guess he also wants it spelled out a bit more – who can take a stab at this?
3. What is the nature of the data
4. What else can you think of ?
5. Evaluate the pros/cons of each engine
6. Differential diagnosis. Consider and discuss at least two
7. Put together your recommendation. Connect requirements to engines
8. Explain (diagram) and be prepared to defend

Leave a Reply

Your email address will not be published. Required fields are marked *