Improve performance of assembleGlobalBasisTransferMatrix

added 1 commit

27404f93 - Remove tracking of processed entries in assembleGlobalBasisTransferMatrix

changed the description

With the implemented improvements the total runtime is reduced significantly. E.g. computing the P1->P2 interpolation is now more than tree times faster. Since computing this interpolation requires a significant part of the total runtime for multigrid with p-coarsening, this also provides a significant improvement for the multigrid performance.

Notice that there is potential for more improvements:

Don't reallocate containers used in the inner loops.
Cache evaluation of coarse basis functions.
Cache interpolation points of fine basis functions.
Cache local interpolation matrices. E.g. for P1->P2 they are the same among all elements with the same GeometryType.
Parallelize the element loops.

added 1 commit

ab970652 - [cleanup] Remove some no longer used code

Compare with previous version

changed the description

mentioned in commit fd02899e

merged

Improve performance of assembleGlobalBasisTransferMatrix

Merged by Carsten Gräser 2 months ago (Jan 26, 2025 7:40pm UTC) 2 months ago

Activity

Improve performance of assembleGlobalBasisTransferMatrix

Merge request reports

Merged by Carsten Gräser 2 months ago (Jan 26, 2025 7:40pm UTC) 2 months ago

Activity