{"id":8,"date":"2022-01-10T13:02:50","date_gmt":"2022-01-10T13:02:50","guid":{"rendered":"http:\/\/vargas-solar.com\/cloud-bigdata\/?page_id=8"},"modified":"2022-05-21T20:56:53","modified_gmt":"2022-05-21T20:56:53","slug":"content-theory","status":"publish","type":"page","link":"http:\/\/vargas-solar.com\/cloud-bigdata\/content-theory\/","title":{"rendered":"CONTENT"},"content":{"rendered":"\n<p>Complete syllabus here: <a href=\"https:\/\/drive.google.com\/file\/d\/1tZ921RGNAEqZ8g4Iv378MnJhuyO1oius\/view?usp=sharing\" target=\"_blank\" rel=\"noreferrer noopener\">LIS-4102<\/a><\/p>\n\n\n\n<ul class=\"wp-block-list\" type=\"1\"><li><strong>Introduction: dealing with data at scale<\/strong> [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1qEA-5OY2c2yH7vZ25ak2VDNpGpuWJXzQ\/view?usp=sharing\" target=\"_blank\">slides<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/N_iR0ChtrE8\" target=\"_blank\">YouTube<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/N_iR0ChtrE8\" target=\"_blank\">YouTube-2<\/a>]<ul><li>Datification and Data properties<\/li><li>Data-centric applications at scale<\/li><li>Computing centres: hardware and resources delivery<\/li><\/ul><\/li><\/ul>\n\n\n\n<ul class=\"wp-block-list\" id=\"block-f7103be5-1f2e-40ab-baf8-45dbaf1f3d43\"><li><strong>Distributed data management and storage\u00a0<\/strong><ul><li>Cluster based data stores [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1XsU9daMWSiB6kHfZSpBWD_9Jaxc428SN\/view?usp=sharing\" target=\"_blank\">slides<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/kcuEkhSW1pY\" target=\"_blank\">YouTube<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/PDrzrCvRBWQ\" target=\"_blank\">YouTube-2<\/a>]<ul><li>[<a rel=\"noreferrer noopener\" href=\"http:\/\/vargas-solar.com\/cloud-bigdata\/mongodb-examples\/\" data-type=\"page\" data-id=\"79\" target=\"_blank\">MongoExamples<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1FH769Jn3M6C1YX6z3YAkEFHAH8I-Sbhk\/view?usp=sharing\" target=\"_blank\">slides<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1FH769Jn3M6C1YX6z3YAkEFHAH8I-Sbhk\/view?usp=sharing\" target=\"_blank\">slides<\/a>-2][<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1kFgHpMx0kgzRHsCY0u-gmRP1U7KuCvWV\/view?usp=sharing\" target=\"_blank\">slides-3<\/a>]<ul><li>Querying: [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/Pxnm539rWoo\" target=\"_blank\">YouTube-1<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/Z99ZsckPCBI\" target=\"_blank\">YouTube-2<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/9yGvenP5d68\" target=\"_blank\">YouTube-3<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/X5TO3hObd2E\" target=\"_blank\">YouTube-4<\/a>]<\/li><li>Sharding:  [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/ytxSk1vAdxA\" target=\"_blank\">YouTube<\/a>]<\/li><\/ul><\/li><li>Graph databases [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1kFgHpMx0kgzRHsCY0u-gmRP1U7KuCvWV\/view?usp=sharing\" target=\"_blank\">slides<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/kZ27meuQ1tg\" target=\"_blank\">YouTube<\/a>]<ul><li>Cypher [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/x5dOHq-S-x0\" target=\"_blank\">YouTube<\/a>]<\/li><\/ul><ul><li>[<a rel=\"noreferrer noopener\" href=\"http:\/\/vargas-solar.com\/cloud-bigdata\/creating-and-querying-graphs-with-cypher-on-neo4j\/\" target=\"_blank\">Neo4JExampl<\/a><a href=\"http:\/\/vargas-solar.com\/cloud-bigdata\/creating-and-querying-graphs-with-cypher-on-neo4j\/\">e<\/a>]<\/li><\/ul><\/li><li>[<a href=\"http:\/\/vargas-solar.com\/cloud-bigdata\/polyglot-data-management-on-the-cloud\/\" data-type=\"page\" data-id=\"154\">Polyglot UseCase<\/a>]<\/li><li>Non-functional properties: concurrency, eventual consistency, &#8230;<\/li><\/ul><\/li><\/ul><ul><li>Distributed archival systems [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/11FTXJEibVQLtp3x5QS7yxf7vL4OLzC9A\/view?usp=sharing\" target=\"_blank\">slides<\/a>]<ul><li>Distributed File Systems<\/li><li>Data Labs<\/li><li>Data Lakes<\/li><\/ul><\/li><\/ul><\/li><\/ul>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Big data processing and analysis&nbsp;<\/strong>Parallel programming models [<a href=\"https:\/\/youtu.be\/lkcAYq6s9TM\" target=\"_blank\" rel=\"noreferrer noopener\">YouTube<\/a>]&nbsp;[<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/33e0i1ErzO0\" target=\"_blank\">YouTube<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/ACbwKMpXWYA\" target=\"_blank\">YouTube<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/e4H59pcRFGU\" target=\"_blank\">YouTube<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/FweOR2yodGM\" target=\"_blank\">YouTube<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/K7RHdeW-DS0\" target=\"_blank\">YouTube<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/ohYl9a02Ny8\" target=\"_blank\">YouTube<\/a>]<ul><li>Map Reduce: families of algorithms and patterns [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1XnlylDFiqu7Nk9ON4HHyTyy3pkWXhrQ-\/view?usp=sharing\" target=\"_blank\">slides &#8211; part A<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1SqF_8OxZhL2yI8wxVjUe1Rk_T27umNDZ\/view?usp=sharing\" target=\"_blank\">slides &#8211; part B<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/docs.google.com\/document\/d\/1i3FGe-zh4SRtZYtgiu3VbADkR1JMgM2ggOlm9iHBWYg\/edit?usp=sharing\" target=\"_blank\">Glossary<\/a>]<\/li><\/ul><ul><li>Data flow-based models: operators, data representation, management [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1w4iAWyrBX0G_A8shZGebnZz7WktjhThP\/view?usp=sharing\" target=\"_blank\">slides-part A<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/18MkYyCAYZJ1JHwAEbw6AqqAPwImxrwtO\/view?usp=sharing\" target=\"_blank\">slides-part B<\/a>]<ul><li>Spark programming Use Case [<a rel=\"noreferrer noopener\" href=\"http:\/\/vargas-solar.com\/cloud-bigdata\/use-case-2-dataflow-based-programming\/\" data-type=\"page\" data-id=\"188\" target=\"_blank\">exercise]<\/a><\/li><\/ul><\/li><\/ul><\/li><\/ul>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Ecosystems for massive data management and processing<\/strong><ul><li>Virtualisation [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1YSh2aVJVVXvn84Ci9LKoadJ3QRGhJYoH\/view?usp=sharing\" target=\"_blank\">slides<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/K7RHdeW-DS0\" target=\"_blank\">YouTube<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/ohYl9a02Ny8\" target=\"_blank\">YouTube<\/a>]<\/li><\/ul><ul><li>Containers [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1TwFpvhgiWRaMYj9TZmS-s0agy6ybLZiN\/view?usp=sharing\" target=\"_blank\">slides<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/vpP457y4ZUE\" target=\"_blank\">YouTube<\/a>] [<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/Ir_42zlVImM\" target=\"_blank\">YouTube<\/a>]<\/li><\/ul><ul><li>High-performance architectures: cluster, HPC, cloud, fog, edge, just in time architectures [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1hZ6a7LtduULBNiqndIGxAiuN-cGsx7j_\/view?usp=sharing\" target=\"_blank\">slides<\/a>]<\/li><\/ul><\/li><\/ul>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Perspectives: open problems and trends<\/strong> [<a rel=\"noreferrer noopener\" href=\"https:\/\/drive.google.com\/file\/d\/1EDzN9_FPXjxA_nFWQJEYD03PiSFa8lUR\/view?usp=sharing\" target=\"_blank\">slides<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/B84Z1zMit4A\" target=\"_blank\">YouTube-1<\/a>][<a rel=\"noreferrer noopener\" href=\"https:\/\/youtu.be\/vTMKa0C7Fic\" target=\"_blank\">YouTube-<\/a>2]<ul><li>Data processing and data management divide<\/li><\/ul><ul><li>Data processing workflows: design, test, deployment, and maintenance<\/li><\/ul><\/li><\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Complete syllabus here: LIS-4102 Introduction: dealing with data at scale [slides] [YouTube][YouTube-2] Datification and Data properties Data-centric applications at scale Computing centres: hardware and resources delivery Distributed data management and storage&nbsp; Cluster based data stores [slides][YouTube] [YouTube-2] [MongoExamples] [slides][slides-2][slides-3] Querying: [YouTube-1] [YouTube-2][YouTube-3][YouTube-4] Sharding: [YouTube] Graph databases [slides] [YouTube] Cypher [YouTube] [Neo4JExample] [Polyglot UseCase] Non-functional properties: [&hellip;]<\/p>\n","protected":false},"author":11,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"page-templates\/full-width.php","meta":{"footnotes":""},"class_list":["post-8","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/pages\/8","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/users\/11"}],"replies":[{"embeddable":true,"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/comments?post=8"}],"version-history":[{"count":46,"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/pages\/8\/revisions"}],"predecessor-version":[{"id":242,"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/pages\/8\/revisions\/242"}],"wp:attachment":[{"href":"http:\/\/vargas-solar.com\/cloud-bigdata\/wp-json\/wp\/v2\/media?parent=8"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}