- Define the notion of Big Data. In your opinion how does this notion opens new challenges to data management?
- Give 5 features that characterise Big Data ? Explain in which way they are challenging for managing data?
- Explain the notion of sequential processing in data management. Do you feel that it is still valid for Big Data management?
- What are the data access patterns? How are they useful ?
- What does SLA mean and what is its role in data management?
- In the case of your domain of expertise, how does Big Data opens novel possibilities or problems/challenges?
- Compare the key-value, document, dimensional records and relational models. In which way are they different? What do we loose/gain having such complex/simple models? In which ways is the notion of key important in these models?
- Explain the notions of sharding and replication on how do they benefit Big Data management?
- In terms of multi-dabases, which are the challenges related to query rewriting in such setting? Give examples using the setting you illustrated in the previous question. Which are the challenges in the case of a sharding setting?
- What is a sharding key? Is the choice of a sharding key directly dependent of the sharding strategy? Explain and give examples.
- Which would be the aspects to consider if you wanted to provide a global solution for updating shards that are not orthogonal (i.e., there are replicated data and there are certain semantic dependencies between shards). What about synchronizing updates?
- Which would be the role and/or benefit of the cloud with respect to Big Data management?
Top