What we are doing is manually creating separate chunks and creating a mesh for smaller areas of the entire mesh, then combining meshes afterwards. Its slow annoying and clunky but it works.
There could be an option for creating blocks inside the bounding box to mesh separately and combine at the end batch style. Would probably be a lot easier to program rather than adding new API's and changing the meshing algorithm. I also noticed that OpenCL 2.0 has support for depth maps now so maybe we could see some implementation of GPU acceleration for meshing?