There's an update on the oacs-5-1 branch that does targetted cache writes for individual nodes in site_node::new and site_node::mount, which is where most of the delay comes from.
I don't think we've seen the end of site node related scaling fixes but this solves some important ones.