What is CLS pooling?
I'm trying to understand a concept in machine learning called CLS pooling. Could someone explain what it is and how it works in simple terms? I'm particularly interested in its application in natural language processing or computer vision tasks.