{"id":17,"date":"2025-04-19T10:50:20","date_gmt":"2025-04-19T10:50:20","guid":{"rendered":"http:\/\/vargas-solar.com\/datsyens\/?page_id=17"},"modified":"2025-04-19T11:04:32","modified_gmt":"2025-04-19T11:04:32","slug":"content","status":"publish","type":"page","link":"http:\/\/vargas-solar.com\/datsyens\/content\/","title":{"rendered":"CONTENT"},"content":{"rendered":"\n<h1 class=\"wp-block-heading\">Week 1<\/h1>\n\n\n\n<ol class=\"wp-block-list\">\n<li><strong>Introduction\u00a0: Systems Thinking in Data-Driven Engineering<\/strong><\/li>\n<\/ol>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Modern analytical architectures<\/li>\n\n\n\n<li>Mapping data pipelines to use cases<\/li>\n\n\n\n<li>Assessment: Concept map of a smart engineering system<\/li>\n<\/ol>\n\n\n\n<p>2. <strong>Data Management and Scalable Engineering Systems<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>3h<\/strong>&nbsp;\u2013 Architectures: Data lakes, Delta Lake, streaming vs batch<\/li>\n\n\n\n<li><strong>3h<\/strong>&nbsp;\u2013 Tools: SQL, NoSQL, Delta tables, Parquet, ingestion with Kafka\/MQTT<\/li>\n\n\n\n<li><strong>3h<\/strong>&nbsp;\u2013 Practice: Build a data pipeline using Pandas, SQL, and Kafka<\/li>\n\n\n\n<li><strong>3h<\/strong>&nbsp;\u2013 Capstone Kickoff: Define project goal, dataset, and tech stack<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Key Outcomes:<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Design data pipelines for high-volume engineering datasets<\/li>\n\n\n\n<li>Efficient querying using SQL-like interfaces<\/li>\n\n\n\n<li>Launch capstone project with clear objectives<\/li>\n<\/ul>\n\n\n\n<h1 class=\"wp-block-heading\">Week 2&nbsp;<\/h1>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>3.\u00a0<\/strong><strong>Data Processing, Querying, and Feature Engineering<\/strong><\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013\u00a0High Volume\u00a0Distributed processing (Dask, Spark), time-windowed analytics<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Advanced data cleaning, temporal aggregation, joins<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Practice: Use Dask\/PySpark on sensor &amp; spatial data<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Capstone: Build pipeline skeleton, run initial EDA<ul><li>&#8211; Satellite\/open data integration<\/li><\/ul><ul><li>&#8211; Tools: GeoPandas, Rasterio<\/li><\/ul>\n<ul class=\"wp-block-list\">\n<li>&#8211; Assessment: Remote sensing notebook + map visuals<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Key Outcomes:<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Handle large time-series and sensor data efficiently<\/li>\n\n\n\n<li>Engineer features for machine learning and forecasting<\/li>\n\n\n\n<li>Develop reproducible workflows for data prep<\/li>\n<\/ul>\n\n\n\n<h1 class=\"wp-block-heading\">Week 3<\/h1>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>4. Analytics, AI Models, and Scalable Inference<\/strong><\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 ML\/DL model selection: forecasting, classification, clustering<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 MLOps: model versioning, training pipelines, drift detection<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Practice: Train &amp; log ML models using MLflow; deploy via FastAPI<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Capstone: Model training &amp; first deployment demo<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Key Outcomes:<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Implement and track scalable machine learning workflows<\/li>\n\n\n\n<li>Set up real-time or batch inference systems<\/li>\n\n\n\n<li>Translate data science models into deployed services<\/li>\n<\/ul>\n\n\n\n<h1 class=\"wp-block-heading\">Week 4<\/h1>\n\n\n\n<p><strong>5. Real-Time Decision Systems and Final Project Delivery<\/strong><\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>3h<\/strong>\u00a0\u2013 Decision systems: optimization, control, dashboards<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Cloud deployment (Colab, AWS, Streamlit, Docker)<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Practice: Build dashboard + run cloud deployment<\/li>\n\n\n\n<li><strong>3h<\/strong>\u00a0\u2013 Capstone: Final testing + presentations<\/li>\n<\/ul>\n\n\n\n<h4 class=\"wp-block-heading\">Key Outcomes:<\/h4>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Integrate analytics into actionable, real-time dashboards<\/li>\n\n\n\n<li>Complete and deploy full-stack capstone project<\/li>\n\n\n\n<li>Deliver technical presentation and documentation<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\"><strong>Capstone Summary<\/strong><\/h2>\n\n\n\n<p>Deliverables by end of Week 4:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>A GitHub repo with data pipeline, model, and deployment scripts<\/li>\n\n\n\n<li>Streamlit\/FastAPI interface or cloud-hosted dashboard<\/li>\n\n\n\n<li>Documentation + 10-min presentation (recorded or live)<\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>Week 1 2. Data Management and Scalable Engineering Systems Key Outcomes: Week 2&nbsp; Key Outcomes: Week 3 Key Outcomes: Week 4 5. Real-Time Decision Systems and Final Project Delivery Key Outcomes: Capstone Summary Deliverables by end of Week 4:<\/p>\n","protected":false},"author":11,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"page-templates\/full-width.php","meta":{"footnotes":""},"class_list":["post-17","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/pages\/17","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/users\/11"}],"replies":[{"embeddable":true,"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/comments?post=17"}],"version-history":[{"count":3,"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/pages\/17\/revisions"}],"predecessor-version":[{"id":20,"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/pages\/17\/revisions\/20"}],"wp:attachment":[{"href":"http:\/\/vargas-solar.com\/datsyens\/wp-json\/wp\/v2\/media?parent=17"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}