2007, Brian D. Ripley, Pattern Recognition and Neural Networks, Cambridge University Press, →ISBN, page 166:
If we have a hyperprior for λ, the extension is quite easy: we can sample from this hyperprior, apply the procedure for each sample based on p(w; λ), and average over samples.