Block diagonal matrix factorization in linear time #575

ctessum · 2025-02-11T01:45:27Z

Is your feature request related to a problem? Please describe.

I'm trying to use IMEX to solve a reaction-advection system, where the reaction is the stiff part and the advection is the non-stiff part.
The issue is described in detail here.

Describe the solution you’d like

The state of the system is a S x N matrix, where S is the number of chemical species and N is the number of grid cells. While solving the stiff part, each column of the state matrix is independent of all the other columns (because the reaction happens within an individual grid cell), so the Jacobian is a Block diagonal matrix.

Because all the grid cells are independent, the solve time should theoretically scale linearly with number of grid cells. However linear solve part of it doesn't scale linearly:

using LinearSolve
using BlockBandedMatrices
using ProgressLogging
using LinearAlgebra
using Plots

ngrid = Int.(round.((10.0.^collect(0.5:0.2:4))./2)) .* 2
ts = []
@progress for N in ngrid
    B = rand(3 * N)
    b = repeat([3], N)
    x = rand(N*3, N*3)
    A = BlockBandedMatrix{eltype(x)}(x, b, b, (0, 0))
    prob = LinearProblem(A, B)
    t = @elapsed solve(prob, LUFactorization())
    push!(ts, t)
end

plot(ngrid, ts, xaxis=:log, yaxis=:log, xlabel="N", legend=:topleft,
    label="Block Diagonal Matrix", ylabel="Time (s)")

plot!(ngrid, ts[3] .* ngrid ./ ngrid[3],
    label="Linear Scaling")

The red line is linear scaling, but as you can see the linear solve with the block banded matrix is more like quadratic.

So I like like it to scale linearly, and just in general be as fast as possible in this case.

Describe alternatives you’ve considered

If we use a BandedMatrix instead of a BlockBandedMatrix, we do get linear scaling, which is the green line in the figure above:

using BandedMatrices

ts = []
@progress for N in ngrid
    B = rand(3 * N)
    b = repeat([3], N)
    x = rand(N*3, N*3)
    A = BandedMatrix{eltype(x)}(x,  (2, 2))
    prob = LinearProblem(A, B)
    t = @elapsed solve(prob, LUFactorization())
    push!(ts, t)
end

plot!(ngrid, ts, label="Banded Matrix")

However, the extra zeros in the banded matrix mean that something like 2x the amount of computations are being done compared to what is strictly necessary. This may be significant when the number of chemical species S is large.

Additional context

I'm told there is a GPU kernel for this here.

oscardssmith · 2025-02-11T03:08:39Z

You likely want to use https://github.com/JuliaArrays/BlockDiagonals.jl. That said, in the future, the Julia github is for bugs in Julia, not for looking for performance advice using the ecosystem. For that, you should post on Discourse, or the Julia slack.

ctessum · 2025-02-11T21:22:04Z

Thank you for your response. I appreciate your work in maintaining this repository. I understand that there are different ways to interpret the purpose of github issues, but I am attempting to request a feature, and as such I chose the "feature request" github issue template provided by the repository. I began by creating a post on discourse, which is here, and the ensuing discussion resulted in @ChrisRackauckas suggesting that I open an issue here, which I have now done.

Thanks for suggesting BlockDiagonals.jl. I tried it out, and this is the result I get:

using LinearSolve
using BlockDiagonals
using ProgressLogging
using LinearAlgebra
using Plots

ngrid = Int.(round.((10.0.^collect(0.5:0.2:3.0))./2)) .* 2
ts = []
@progress for N in ngrid
    B = rand(3 * N)
    b = repeat([3], N)
    x = rand(N*3, N*3)
    A = BlockDiagonal([rand(3, 3) for i in 1:N])
    #A = BlockBandedMatrix{eltype(x)}(x, b, b, (0, 0))
    prob = LinearProblem(A, B)
    t = @elapsed solve(prob, LUFactorization())
    push!(ts, t)
end


plot(ngrid, ts, xaxis=:log, yaxis=:log, xlabel="N", legend=:topleft,
    label="Block Diagonal Matrix", ylabel="Time (s)")

plot!(ngrid, ts[3] .* ngrid ./ ngrid[3],
    label="Linear Scaling")

oscardssmith · 2025-02-11T21:53:26Z

sorry for closing. I misread somehow and thought that this issue was on the Julia repository rather than the LinearSolve one. This is very much in the right place.

ctessum · 2025-02-11T23:00:24Z

No problem!

ChrisRackauckas · 2025-02-12T04:05:50Z

It looks like BlockDiagonals.jl does not overload LU: https://github.com/JuliaArrays/BlockDiagonals.jl/blob/master/src/linalg.jl#L175-L188

It wouldn't be too hard to take that code and write an lu and and ldiv! for that type though, looping through the same way that is currently done in that dispatch.

oscardssmith · 2025-02-12T04:20:01Z

it even could multithread the blocks for extra speed.

ChrisRackauckas · 2025-02-12T06:54:45Z

yes indeed

ctessum · 2025-02-12T15:50:31Z

There is also this package: https://github.com/mipals/BlockDiagonalMatrices.jl

At the bottom of the readme it says it is written in a way that makes parallelization easier.

oscardssmith closed this as completed Feb 11, 2025

oscardssmith reopened this Feb 11, 2025

ctessum mentioned this issue Feb 16, 2025

Add LU factorization and ldiv! mipals/BlockDiagonalMatrices.jl#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Block diagonal matrix factorization in linear time #575

Block diagonal matrix factorization in linear time #575

ctessum commented Feb 11, 2025

oscardssmith commented Feb 11, 2025

ctessum commented Feb 11, 2025

oscardssmith commented Feb 11, 2025 •

edited

Loading

ctessum commented Feb 11, 2025

ChrisRackauckas commented Feb 12, 2025

oscardssmith commented Feb 12, 2025

ChrisRackauckas commented Feb 12, 2025

ctessum commented Feb 12, 2025

Block diagonal matrix factorization in linear time #575

Block diagonal matrix factorization in linear time #575

Comments

ctessum commented Feb 11, 2025

oscardssmith commented Feb 11, 2025

ctessum commented Feb 11, 2025

oscardssmith commented Feb 11, 2025 • edited Loading

ctessum commented Feb 11, 2025

ChrisRackauckas commented Feb 12, 2025

oscardssmith commented Feb 12, 2025

ChrisRackauckas commented Feb 12, 2025

ctessum commented Feb 12, 2025

oscardssmith commented Feb 11, 2025 •

edited

Loading