"Deep learning" is shorthand for the creation of a function F(x, v) so that the inputs v (the training data) produce correct outputs. So it is a new type of interpolation. The mathematics is a combination of linear algebra and calculus (optimizing the weights) and statistics (controlling the variance). The18.065 course at MIT has become a "second course in linear algebra" for students from all departments and all years. It has a textbook, Linear Algebra and Learning from Data. Video lectures are on OpenCourseWare. The key link from linear algebra to data science is the Singular Value Decomposition. It has become the foundation of applied linear algebra and we need to teach it.