{"id":10,"date":"2014-11-19T22:20:38","date_gmt":"2014-11-19T22:20:38","guid":{"rendered":"http:\/\/vargas-solar.com\/bigdata-fest\/?page_id=10"},"modified":"2016-03-07T20:56:12","modified_gmt":"2016-03-07T20:56:12","slug":"practice","status":"publish","type":"page","link":"http:\/\/vargas-solar.com\/bigdata-fest\/practice\/","title":{"rendered":"HANDS ON"},"content":{"rendered":"<h3><strong>NoSQL data stores: expressing queries using MapReduce<\/strong><\/h3>\n<ul>\n<li>Downloading Couch:\u00a0<a href=\"http:\/\/couchdb.apache.org\/\">http:\/\/couchdb.apache.org<\/a>\u00a0 \u00a0[<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/curl.zip\">cURL for Windows<\/a>]\n<ol>\n<li>Building a document database: using CouchDB [<a href=\"http:\/\/vargas-solar.com\/bigdata-management\/wp-content\/uploads\/sites\/31\/2013\/11\/Ex1-2Do2HandIn-noSQL.pdf\">Ex-1<\/a>] [<a href=\"http:\/\/vargas-solar.com\/bigdata-management\/wp-content\/uploads\/sites\/31\/2013\/11\/Ex1-2Do2HandIn-noSQL-corrige.pdf\">Ex1-answers<\/a>]<\/li>\n<li>Querying a document database [<a href=\"http:\/\/vargas-solar.com\/bigdata-management\/wp-content\/uploads\/sites\/31\/2013\/11\/Ex2-ToDo-ToHandIn.pdf\">Ex-2<\/a>] [<em>answers on explicit demand<\/em>]<\/li>\n<\/ol>\n<\/li>\n<\/ul>\n<div class=\"entry-content\">\n<h3>Sharding a data for balancing loads and ensuring availability<\/h3>\n<ul>\n<li>Sharding MongoBD\n<ul>\n<li>Exercise\u00a0[<a href=\"http:\/\/vargas-solar.com\/data-management-services-cloud\/wp-content\/uploads\/sites\/32\/2015\/01\/Ex1-2Do2Handin.pdf\">Ex1-2Do2Handin<\/a>] [<a href=\"http:\/\/vargas-solar.com\/data-management-services-cloud\/wp-content\/uploads\/sites\/32\/2014\/01\/cities.txt\">cities<\/a>]<\/li>\n<li>Mongo reference guide\u00a0[<a href=\"http:\/\/vargas-solar.com\/data-management-services-cloud\/wp-content\/uploads\/sites\/32\/2014\/01\/MongoDB-sharding-guide.pdf\">MongoDB-sharding-guide<\/a>]<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/div>\n<h3><strong>\u00a0Data sanitation with Pig<\/strong><\/h3>\n<ul>\n<li>Installing Pig\n<ol>\n<li><span style=\"line-height: 1.714285714; font-size: 1rem;\">Hortonworks [<\/span><a style=\"line-height: 1.714285714; font-size: 1rem;\" href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/PIG-HortonWorks.pdf\">pdf<\/a><span style=\"line-height: 1.714285714; font-size: 1rem;\">]<\/span><\/li>\n<li><span style=\"font-size: 1rem; line-height: 1.714285714;\">Testing your installation: [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/publications.csv\">data<\/a>] [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/TestScript.pig_.zip\">PigScript<\/a>]<\/span><\/li>\n<\/ol>\n<\/li>\n<li>Dealing with network behavior data collections [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/An\u00e1lisis-de-grandes-colecciones-de-datos-con-Pig-Latin.pdf\">pdf<\/a>]\n<ul>\n<li><strong><a href=\"https:\/\/drive.google.com\/file\/d\/0BxC8-b_J_afKU1daMW1ZMncxY1U\/view?usp=sharing\">data<\/a><\/strong><\/li>\n<li>[<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/neubot.pig_.zip\">neubot.pig (zip)<\/a>] [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/NeubotTestsUDFs.jar_.zip\">NeubotTestsUDFs.jar (zip)<\/a>]<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<h3><strong>Data analytics\u00a0with\u00a0Hadoop<\/strong><\/h3>\n<ul>\n<li>Environment: hadoop on Hortonworks<\/li>\n<li>Counting words and other summarization challenges [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/AllData.zip\">AllData<\/a>]\n<ol>\n<li>Counting words: first approach\u00a0 [\u00a0<a href=\"http:\/\/vargas-solar.com\/map-reduce-fest\/wp-content\/uploads\/sites\/21\/2013\/04\/running-the-wordcount-example.pdf\">pdf<\/a>\u00a0] [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/WordCount-Example.zip\">WordCount Example<\/a>]<\/li>\n<li>Counting with some optimizations using combiners: understanding some principles of the map reduce model [\u00a0<a href=\"http:\/\/vargas-solar.com\/map-reduce-fest\/wp-content\/uploads\/sites\/21\/2013\/04\/hoop-ex-1.pdf\">pdf<\/a>\u00a0] [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/MapReduce-book-final.pdf\">MapReduce-book-final<\/a>] [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/code.zip\">code examples<\/a>]<\/li>\n<\/ol>\n<\/li>\n<li>Some interesting map reduce patterns: <span style=\"color: #0000ff;\"><em><strong>see the challenges section<\/strong><\/em><\/span> [<a href=\"http:\/\/vargas-solar.com\/bigdata-fest\/wp-content\/uploads\/sites\/33\/2014\/11\/MapReduce-Design-Patterns-V413HAV.pdf\">patterns reference<\/a>]<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>NoSQL data stores: expressing queries using MapReduce Downloading Couch:&nbsp;http:\/\/couchdb.apache.org&nbsp; &nbsp;[cURL for Windows] Building a document database: using CouchDB [Ex-1] [Ex1-answers] Querying a document database [Ex-2] [answers on explicit demand] Sharding a data for balancing loads and ensuring availability Sharding MongoBD Exercise&nbsp;[Ex1-2Do2Handin] [cities] Mongo reference guide&nbsp;[MongoDB-sharding-guide] &nbsp;Data sanitation with Pig Installing Pig Hortonworks [pdf] Testing your [&hellip;]<\/p>\n","protected":false},"author":11,"featured_media":0,"parent":0,"menu_order":2,"comment_status":"closed","ping_status":"closed","template":"page-templates\/full-width.php","meta":{"footnotes":""},"class_list":["post-10","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/pages\/10","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/users\/11"}],"replies":[{"embeddable":true,"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/comments?post=10"}],"version-history":[{"count":29,"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/pages\/10\/revisions"}],"predecessor-version":[{"id":191,"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/pages\/10\/revisions\/191"}],"wp:attachment":[{"href":"http:\/\/vargas-solar.com\/bigdata-fest\/wp-json\/wp\/v2\/media?parent=10"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}