CampusAnswers

DSCI326 — Data Science at Scale

DSCI 326 - Data Science at Scale DSCI 326 - Data Science at Scale (3 credits) The focus of this course is on exposing students to processing data on a large scale using a distributed platform. First, students will learn the functional approach to processing large data sets. Along the way, we will encounter many of the techniques that are employed in large distributed data-processing systems, such as using common higher-order functions, employing lazy evaluation, and relying on immutable data structures. By the end of the course students will be familiar with processing large amounts of data in one or more high-level languages (e.g. Python and/or Scala) and working with a number of frameworks for distributed computation (e.g. Hadoop/MapReduce/Spark). Grade or P/NC. alternate years. Course Registration

Prerequisites: DSCI325

Part of

Source ↗

← back to winona catalog