Mathematical Tools for Big Data

Information

Teachers: Eugénio Rocha

Duration: One semester

Work hours: 162

Contact hours: 45

ECTS: 6

Scientific area: Mathematics

Objectives

Technological advances that have taken place in recent decades have provided a capacity never before possible for storing and making information available. This curricular unit has as its main objective the contact with some mathematical techniques dedicated to the treatment of these large volumes of data, whose study is vital to human activity.

Learning Outcomes

After finishing this course, students should be able to use numerical optimization methods in large-scale problems, apply dimensionality reduction techniques and aggregation methodologies, deal with concepts of information and entropy in inference, analyze large-dimensional graphs , as well as using computational learning techniques (machine learning) suitable for large-scale problems. Students are also expected to be able to interpret and communicate technical results in any intercultural environment.

Grading

Grading will consist of the presentation and discussion, in class, of a work (50%) and the completion of a written exam (50%).

Methodology

Classes take place in rooms equiped with computers. Special emphasis will be given to the presentation of techniques, algorithms and software (MATLAB, R and Python). Autonomy in solving proposed problems will be strongly encouraged.

Syllabus

Numerical optimization methods in large problems
Dimensionality reduction
Aggregation procedures on homogeneous and non-homogeneous data
Info-Metrics (information, maximum entropy and inference)
Regression, classification and clustering algorithms for large problems
Large-dimensional graph analysis (connectivity, centrality, paths).