We don't have exact cycle counts measured, but you should be able to do that with the cycle counter hardware or an accurate simulator. Here are some sizes and timing measurements for an M7 core running at 216 MHz:
For an FPU, there will be additional cycles required to save/restore the FPU registers when context switching. For an MPU, you'll have to use ThreadX Modules, and we do not have any benchmarks for Modules yet.