1. Define the notion of Big Data. In your opinion how does this notion opens new challenges to data management?
  2. Give 5 features that characterise Big Data ? Explain in which way they are challenging for managing data?
  3. Explain the notion of sequential processing in data management. Do you feel that it is still valid for Big Data management?
  4. What are the data access patterns? How are they useful ?
  5. What does SLA mean and what is its role in data management?
  6. In the case of your domain of expertise,  how does Big Data opens novel possibilities or problems/challenges?
  7. Compare the key-value, document, dimensional records and relational models. In which way are they different? What do we loose/gain having such complex/simple models? In which ways is the notion of key important in these models?
  8. Explain the notions of sharding and replication on how do they benefit Big Data management?
  9. In terms of multi-dabases, which are the challenges related to query rewriting in such setting? Give examples using the setting you illustrated in the previous question. Which are the challenges in the case of a sharding setting?
  10. Which would be the aspects to consider if you wanted to provide a global solution for updating shards that are not orthogonal (i.e., there are replicated data and there are certain semantic dependencies between shards). What about synchronizing updates?
  11. Which would be the role and/or benefit of the cloud with respect to Big Data management?
  12. What is the difference between classic database management systems architectures and the functions view of data management today? Refer to HDFS, structured data systems, distributed processing systems, ad hoc purpose query languages