There's support in HuC for Pragma Fastcall, but no ever mentions or documents it. I've used it extensively. As you've probably already seen, the library is actually in all assembly. You can make your own assembly functions. Normally, when you make a C function in HuC, the compiler uses a software stack to pass arguments (and no, it doesn't modify the real stack - too small). Vars defined inside that function, also use a software stack (IIRC, it's been forever). You can get around it by using Global variables. But a much better method, once you get your code prototyped in C, is make an assembly version of it.
Fastcall allows you to bypass the internal stack and use either A/X/Y and dedicated Zeropage bytes, for passing arguments (as if they're PROCs in magickit, but they actually don't use the proc directive). You can even do argument overloading with fastcall; one function - multiple number of arguments. You can also define the size of ~each~ argument (byte, word, 24bit, 32bit). And lastly, fastcall also allows direct code placement - sort of. You can define a fastcall function that has no code. I.e. instead of passing the arguments to Acc/X/Y or zeropage, it passed the arguments directly to a port. I did this for arcade card reg writes.
You have to prototype your fastcall argument in C code, and then the actual code goes (usually) in the first library bank (because this bank is fixed, and is 'near' code). If you run out of room in the first Lib bank, remove some stuff, or put a 'far' code call in there (which is done for code in the second Lib bank).
As Old Rov said, simply just placing #asm/#endasm directives anyway will give you quick access to assembly code. Not that global variables defined in HuC can be accessed with the same name, just add an under score in front of it. Accessing internal function defines vars with #ASM, is more complicated.
Beware of pointer or array access in HuC. It's all treated as 'far' memory access, even static mapped ram, and code generated to access it is huge and slooow. Logic shifts are also pretty slow as well in HuC (go figure).
Anyway, that's my 2 cents worth