Candidate sampling for softmax neural networks in mathematics

Although I have been using mathica for some years, I have never delved into its ML capabilities (except for the occasional grouping). I am curious to know if someone who has experience in the use of Neural Networks of Mathematica has done something with the sampling of candidates with a softmax output, and if so, how can be done to achieve it.