Clang itself is ~400k once you exclude things like the static analyzer and all the various tooling built on top of it that's shipped with clang. Either way, that number is sort of a drop in the bucket compared to LLVM. LLVM is millions of lines of code, very little of which could be dropped simply by only supporting C. Even a C-only programmer wants things like optimization passes and backends for every platform they plan on running on.