Glossary term
Glossary term
Multimodal AI
A type of cell in a recurrent neural network used to process sequences of data in applications such as handwriting recognition, machine translation, and image captioning. LSTMs address the vanishing gradient problem that occurs when training RNNs due to long data sequences by maintaining history in an internal memory state based on new input and context from previous cells in the RNN.
Created for this library
A speech recognition vendor uses an LSTM in its acoustic model to capture temporal context across audio frames.
A demand forecasting team uses an LSTM on weekly sales sequences to capture seasonal context per SKU.
An NLP team uses an LSTM as a baseline on a small text classification task before evaluating transformers.
Definition source: Google for Developers Machine Learning Glossary | Creative Commons Attribution 4.0 License