Форма обучения:
дистанционная
Стоимость самостоятельного обучения:
бесплатно
Доступ:
свободный
Документ об окончании:
 
Уровень:
Специалист
Длительность:
14:25:00
Студентов:
288
Выпускников:
18
Качество курса:
4.00 | 4.20
The course concentrates mostly on application performance improvements with Intel Compiler and VTune Amplifier.
It briefly describes microprocessor architecture; application performance factors and common speedup techniques: scalar optimizations, loop optimizations, vectorization, parallelization, interprocedural optimizations and profiler guide optimizations. The course describes compiler architecture and command line options, compiler limitations and methods of providing additional information to the compiler. It gives a first insight to the performance analysis. Practical examples help to become familiar with VTune usage and the ideas of performance optimization.
Специальности: Программист
 

План занятий

Занятие
Заголовок <<
Дата изучения
Лекция 2
17 минут
Intel® performance analyze tools
Second lecture briefly describes important performance tool VTune Amplifier and describes the main ideas of its usage; the common scheme of performance tuning; VTune graphical interface; the main analysis techniques and their implementation at VTune.
-
Лекция 3
42 минуты
Optimizing compiler Scalar optimizations
At this lecture Intel Compilers approximate compilation scheme is given. Role of the frontend and the internal representation of the compiler. Control flow graph and its importance for the analysis. The basic scalar optimizations. The notion of the static single assignment form.
-
Тест 3
42 минуты
-
Лекция 4
43 минуты
Optimizing compiler. Loop optimizations
This lecture is devoted to the loops and loop optimizations. Discussed topics are: the problem of classification and loop recognition; the reasons for loop optimization precise study; common loop optimizations; optimization examples; the notion of dependency and permutation; optimization admissibility criterion in terms of dependencies; compiler command line options.
-
Лекция 6
59 минут
Optimizing compiler. Auto parallelization
The lecture describes main features of the multiprocessor and multicore computing systems, pros and cons of the multithreaded applications. Auto parallelization as the simple method for multi-threaded application creation. Compiler command line options. Some of the language extensions used for parallelization Manual and automatic prefetching.
-
Тест 6
33 минуты
-
Лекция 7
14 минут
OpenMP fundamentials
In this lecture describes OpenMP basics. OpenMP workflow and its limitation. Different variables and sections of the code, critical and concurrent sections, loop parallelization directives, loop iteration scheduling.
-
Лекция 9
51 минута
Optimizing compiler. Static and dynamic profiler. Memory manager. Code generator
This lecture describes the roles of the profiler, memory manager and code generator. Principles of dynamic profiler usage. The difference between static and dynamic profiling. Memory distribution and allocation issues. Memory manager affection on application performance. Register allocation and instruction scheduling.
-
Тест 9
24 минуты
-
1 час 40 минут
-