Innovation & Tech (CTO) Curriculum Electives

Stars, Flakes, Vaults and the Sins of Denormalisation

2019-10-18T03:01:05+00:00Categories: Data Governance Level 2, Innovation & Tech (CTO) Curriculum Electives, Data Governance Curriculum Electives, Executive Curriculum Electives, Innovation & Tech (CTO) Level 2, Stephen Brobst, Data Engineering Curriculum, Data Management, AI Engineering Curriculum, Executive Level 2, Data Engineering Level 1, AI Engineering Level 1, All Academy Courses|Tags: , , , |

Providing both performance and flexibility are often seen as contradictory goals in designing large scale data implementations. In this talk we will discuss techniques for denormalisation and provide a framework for understanding the performance and flexibility implications of various design options. We will examine a variety of logical and physical design approaches and evaluate the trade offs between them. Specific recommendations are made for guiding the translation from a normalised logical data model to an engineered-for-performance physical data model. The role of dimensional modeling and various physical design approaches are discussed in detail. Best practices in the use of surrogate keys is also discussed. The focus is on understanding the benefit (or not) of various denormalisation approaches commonly taken in analytic database designs.

Best Practices in Enterprise Information Management

2019-10-24T04:45:22+00:00Categories: Data Culture Level 1, Data Culture Curriculum, Innovation & Tech (CTO) Curriculum Electives, Data Governance Curriculum, Stephen Brobst, Fraud and Security, Executive Curriculum, Data Engineering Curriculum, Data Governance Level 1, Data Management, Executive Level 2, Big Data, Data Engineering Level 1, All Academy Courses, Innovation & Tech (CTO) Level 3|Tags: , , , , , |

The effective management of enterprise information for analytics deployment requires best practices in the areas of people, processes, and technology. In this talk we will share both successful and unsuccessful practices in these areas. The scope of this workshop will involve five key areas of enterprise information management: (1) metadata management, (2) data quality management, (3) data security and privacy, (4) master data management, and (5) data integration.

Overcoming Information Overload with Advanced Practices in Data Visualisation

2019-10-24T04:46:56+00:00Categories: Data Culture Electives, Innovation & Tech (CTO) Curriculum Electives, Data Science Curriculum, Data Science Level 1, Data Culture Level 2, Innovation & Tech (CTO) Level 2, Stephen Brobst, Executive Curriculum, Data Visualisation, Data Management, AI Engineering Curriculum, Executive Level 2, Big Data, AI Engineering Level 1, All Academy Courses|Tags: , , , , , , , |

In this workshop, we explore best practices in deriving insight from vast amounts of data using visualisation techniques. Examples from traditional data as well as an in-depth look at the underlying technologies for visualisation in support of geospatial analytics will be undertaken. We will examine visualisation for both strategic and operational BI.

Advanced Deep Learning

2019-10-17T02:41:05+00:00Categories: Keras, Innovation & Tech (CTO) Curriculum Electives, Data Engineering Curriculum Electives, Tensorflow, Data Science Curriculum, Python, Dr Eugene Dubossarsky, AI Engineering Curriculum, Data Science Level 3, Data Engineering Level 3, AI Engineering Level 3, All Academy Courses, Innovation & Tech (CTO) Level 3|Tags: , |

This course provides a more rigorous, mathematically based view of modern neural networks, their training, applications, strengths and weaknesses, focusing on key architectures such as convolutional nets for image processing and recurrent nets for text and time series. This course will also include use of dedicated hardware such as GPUs and multiple computing nodes on the cloud. There will also be an overview of the most common available platforms for neural computation. Some topics touched in the introduction will be revisited in more thorough detail. Optional advanced topics may include Generative Adversarial Networks, Reinforcement Learning, Transfer Learning and probabilistic neural networks.

Cost-Based Optimisation: Obtaining the Best Execution Plan for Complex Queries

2019-10-24T04:52:00+00:00Categories: Data Governance Level 2, Predictive Analytics & AI, Innovation & Tech (CTO) Curriculum Electives, Data Science Level 2, Data Science Curriculum Electives, Data Governance Curriculum, Innovation & Tech (CTO) Level 2, Stephen Brobst, Data Engineering Curriculum, Data Engineering Level 2, AI Engineering Curriculum, Big Data, AI Engineering Level 2, All Academy Courses|Tags: , , , |

Optimiser choices in determining the execution plan for complex queries is a dominant factor in the performance delivery for a data foundation environment. The goal of this workshop is to de-mystify the inner workings of cost-based optimisation for complex query workloads. We will discuss the differences between rule-based optimisation and cost-based optimisation with a focus on how a cost-based optimization enumerates and selects among possible execution plans for a complex query. The influences of parallelism and hardware configuration on plan selection will be discussed along with the importance of data demographics. Advanced statistics collection is discussed as the foundational input for decision-making within the cost-based optimiser. Performance characteristics and optimiser selection among different join and indexing opportunities will also be discussed with examples. The inner workings of the query re-write engine will be described along with the performance implications of various re-write strategies.

Social Network Analysis: Practical Use Cases and Implementation

2019-10-24T04:51:00+00:00Categories: Data Governance Level 2, Predictive Analytics & AI, Data Culture Electives, Innovation & Tech (CTO) Curriculum Electives, Data Science Curriculum, Data Governance Curriculum Electives, Executive Curriculum Electives, Marketing, Data Science Level 1, Data Culture Level 2, Innovation & Tech (CTO) Level 2, Stephen Brobst, Fraud and Security, Data Management, AI Engineering Curriculum, Executive Level 2, Big Data, AI Engineering Level 2, All Academy Courses|Tags: , , , |

Social networking via Web 2.0 applications such as LinkedIn and Facebook has created huge interest in understanding the connections between individuals to predict patterns of churn, influencers related to early adoption of new products and services, successful pricing strategies for certain kinds of services, and customer segmentation. We will explain how to use these advanced analytic techniques with mini case studies across a wide range of industries including telecommunications, financial services, health care, retailing, and government agencies. 

Capacity Planning for Enterprise Data Deployment

2019-10-24T04:51:48+00:00Categories: Innovation & Tech (CTO) Curriculum Electives, Data Governance Curriculum Electives, Executive Curriculum Electives, Stephen Brobst, Data Engineering Curriculum, Data Governance Level 1, Data Management, AI Engineering Curriculum, Infrastructure & Technologies, Data Engineering Level 1, AI Engineering Level 1, Executive Level 1, All Academy Courses|Tags: , , , |

This workshop describes a framework for capacity planning in an enterprise data environment. We will propose a model for defining service level agreements (SLAs) and then using these SLAs to drive the capacity planning and configuration for enterprise data solutions. Guidelines will be provided for capacity planning in a mixed workload environment involving both strategic and tactical decision support. Performance implications related to technology trends in multi-core CPU deployment, large memory deployment, and high density disk drives will be described. In addition, the capacity planning implications for different approaches for data acquisition will be considered.