{"id":6224,"date":"2025-12-11T10:44:31","date_gmt":"2025-12-11T10:44:31","guid":{"rendered":"https:\/\/sahelib.atatec-design.com\/index.php\/2025\/12\/11\/traitement-automatique-du-langage-pour-la-classification-darticles\/"},"modified":"2025-12-11T13:15:48","modified_gmt":"2025-12-11T13:15:48","slug":"traitement-automatique-du-langage-pour-la-classification-darticles","status":"publish","type":"post","link":"https:\/\/sahelib.atatec-design.com\/index.php\/2025\/12\/11\/traitement-automatique-du-langage-pour-la-classification-darticles\/","title":{"rendered":"Traitement automatique du langage pour la classification d&#8217;articles"},"content":{"rendered":"<h2>Traitement automatique du langage pour la classification d&#8217;articles<\/h2>\n<p><strong>Auteur(s) :<\/strong> Dr. Ali Diop \u2014 <strong>Date :<\/strong> 2020-05-05 \u2014 <strong>Source :<\/strong> arXiv<\/p>\n<h2 data-start=\"463\" data-end=\"481\"><strong data-start=\"466\" data-end=\"481\">R\u00e9sum\u00e9 (FR)<\/strong><\/h2>\n<p data-start=\"483\" data-end=\"1606\">Le traitement automatique du langage (TAL) constitue aujourd\u2019hui un outil fondamental pour organiser et analyser de vastes corpus textuels dans le domaine scientifique. La classification d\u2019articles, en particulier, permet d\u2019identifier automatiquement le domaine, le th\u00e8me et la pertinence des publications, facilitant ainsi la veille scientifique, la recherche documentaire et la recommandation d\u2019informations. Cet article pr\u00e9sente une revue compl\u00e8te des approches bas\u00e9es sur le TAL pour la classification d\u2019articles scientifiques. Nous abordons les m\u00e9thodes traditionnelles (TF-IDF, na\u00efve Bayes, SVM) ainsi que les techniques modernes bas\u00e9es sur l\u2019apprentissage profond (Word2Vec, BERT, Transformers). Une analyse comparative des performances des diff\u00e9rentes approches est propos\u00e9e, mettant en \u00e9vidence leurs avantages, limites et perspectives d\u2019int\u00e9gration dans des syst\u00e8mes de recommandation ou de gestion de biblioth\u00e8ques num\u00e9riques. Enfin, nous discutons des d\u00e9fis actuels et des opportunit\u00e9s offertes par les mod\u00e8les contextuels pour am\u00e9liorer la pr\u00e9cision et l\u2019efficacit\u00e9 de la classification automatique d\u2019articles.<\/p>\n<p data-start=\"1608\" data-end=\"1729\"><strong data-start=\"1608\" data-end=\"1623\">Mots-cl\u00e9s :<\/strong> Traitement automatique du langage, Classification d\u2019articles, Machine learning, Deep Learning, BERT, SVM.<\/p>\n<hr data-start=\"1731\" data-end=\"1734\" \/>\n<h2 data-start=\"1736\" data-end=\"1756\"><strong data-start=\"1739\" data-end=\"1756\">Abstract (EN)<\/strong><\/h2>\n<p data-start=\"1758\" data-end=\"2708\">Natural Language Processing (NLP) has become a fundamental tool for organizing and analyzing large text corpora in the scientific domain. Article classification, in particular, enables the automatic identification of the field, topic, and relevance of publications, thereby facilitating scientific monitoring, literature search, and information recommendation. This paper provides a comprehensive review of NLP-based approaches for scientific article classification. Traditional methods (TF-IDF, Naive Bayes, SVM) and modern deep learning techniques (Word2Vec, BERT, Transformers) are discussed. A comparative analysis of the performance of these approaches highlights their strengths, limitations, and integration potential in recommendation systems or digital library management. Finally, current challenges and the opportunities offered by contextual models to enhance the accuracy and efficiency of automatic article classification are discussed.<\/p>\n<p data-start=\"2710\" data-end=\"2820\"><strong data-start=\"2710\" data-end=\"2723\">Keywords:<\/strong> Natural Language Processing, Article Classification, Machine Learning, Deep Learning, BERT, SVM.<\/p>\n<hr data-start=\"2822\" data-end=\"2825\" \/>\n<h2 data-start=\"2827\" data-end=\"2849\"><strong data-start=\"2830\" data-end=\"2849\">1. Introduction<\/strong><\/h2>\n<p data-start=\"2851\" data-end=\"3183\">Avec la croissance exponentielle de la production scientifique, la gestion et l\u2019analyse des publications deviennent un d\u00e9fi majeur pour les chercheurs, biblioth\u00e9caires et institutions acad\u00e9miques. Chaque ann\u00e9e, des millions d\u2019articles sont publi\u00e9s dans diff\u00e9rents domaines, rendant la recherche manuelle fastidieuse et inefficace.<\/p>\n<p data-start=\"3185\" data-end=\"3488\">Le <strong data-start=\"3188\" data-end=\"3231\">traitement automatique du langage (TAL)<\/strong>, combin\u00e9 aux m\u00e9thodes d\u2019apprentissage automatique et d\u2019apprentissage profond, permet de <strong data-start=\"3320\" data-end=\"3377\">classifier automatiquement les articles scientifiques<\/strong> selon leur domaine, leurs mots-cl\u00e9s, leur pertinence ou leur impact potentiel. Cette classification facilite :<\/p>\n<ul data-start=\"3490\" data-end=\"3730\">\n<li data-start=\"3490\" data-end=\"3524\">\n<p data-start=\"3492\" data-end=\"3524\">La veille scientifique cibl\u00e9e.<\/p>\n<\/li>\n<li data-start=\"3525\" data-end=\"3569\">\n<p data-start=\"3527\" data-end=\"3569\">La recommandation d\u2019articles pertinents.<\/p>\n<\/li>\n<li data-start=\"3570\" data-end=\"3652\">\n<p data-start=\"3572\" data-end=\"3652\">L\u2019organisation de bases de donn\u00e9es acad\u00e9miques et de biblioth\u00e8ques num\u00e9riques.<\/p>\n<\/li>\n<li data-start=\"3653\" data-end=\"3730\">\n<p data-start=\"3655\" data-end=\"3730\">L\u2019extraction d\u2019informations pour les revues syst\u00e9matiques et m\u00e9ta-analyses.<\/p>\n<\/li>\n<\/ul>\n<p data-start=\"3732\" data-end=\"3914\">Cet article propose une <strong data-start=\"3756\" data-end=\"3843\">revue syst\u00e9matique des m\u00e9thodes classiques et modernes de classification d\u2019articles<\/strong>, en mettant l\u2019accent sur leurs performances, avantages et limitations.<\/p>\n<hr data-start=\"3916\" data-end=\"3919\" \/>\n<h2 data-start=\"3921\" data-end=\"3944\"><strong data-start=\"3924\" data-end=\"3944\">2. \u00c9tat de l\u2019art<\/strong><\/h2>\n<h3 data-start=\"3946\" data-end=\"3982\"><strong data-start=\"3950\" data-end=\"3982\">2.1 M\u00e9thodes traditionnelles<\/strong><\/h3>\n<ol data-start=\"3984\" data-end=\"4755\">\n<li data-start=\"3984\" data-end=\"4222\">\n<p data-start=\"3987\" data-end=\"4021\"><strong data-start=\"3987\" data-end=\"4019\">Bag of Words (BoW) et TF-IDF<\/strong><\/p>\n<ul data-start=\"4025\" data-end=\"4222\">\n<li data-start=\"4025\" data-end=\"4104\">\n<p data-start=\"4027\" data-end=\"4104\">Repr\u00e9sentation vectorielle des documents bas\u00e9e sur la fr\u00e9quence des termes.<\/p>\n<\/li>\n<li data-start=\"4108\" data-end=\"4153\">\n<p data-start=\"4110\" data-end=\"4153\">Avantages : simplicit\u00e9, interpr\u00e9tabilit\u00e9.<\/p>\n<\/li>\n<li data-start=\"4157\" data-end=\"4222\">\n<p data-start=\"4159\" data-end=\"4222\">Limites : ne capture pas le contexte ou la s\u00e9mantique des mots.<\/p>\n<\/li>\n<\/ul>\n<\/li>\n<li data-start=\"4224\" data-end=\"4542\">\n<p data-start=\"4227\" data-end=\"4273\"><strong data-start=\"4227\" data-end=\"4271\">Algorithmes de classification supervis\u00e9e<\/strong><\/p>\n<ul data-start=\"4277\" data-end=\"4542\">\n<li data-start=\"4277\" data-end=\"4344\">\n<p data-start=\"4279\" data-end=\"4344\"><strong data-start=\"4279\" data-end=\"4294\">Na\u00efve Bayes<\/strong> : probabiliste, efficace pour de grands corpus.<\/p>\n<\/li>\n<li data-start=\"4348\" data-end=\"4447\">\n<p data-start=\"4350\" data-end=\"4447\"><strong data-start=\"4350\" data-end=\"4383\">Support Vector Machines (SVM)<\/strong> : performantes pour des donn\u00e9es textuelles \u00e0 haute dimension.<\/p>\n<\/li>\n<li data-start=\"4451\" data-end=\"4542\">\n<p data-start=\"4453\" data-end=\"4542\"><strong data-start=\"4453\" data-end=\"4483\">k-Nearest Neighbors (k-NN)<\/strong> : simple mais moins scalable pour des corpus volumineux.<\/p>\n<\/li>\n<\/ul>\n<\/li>\n<li data-start=\"4544\" data-end=\"4755\">\n<p data-start=\"4547\" data-end=\"4593\"><strong data-start=\"4547\" data-end=\"4591\">Limitations des m\u00e9thodes traditionnelles<\/strong><\/p>\n<ul data-start=\"4597\" data-end=\"4755\">\n<li data-start=\"4597\" data-end=\"4639\">\n<p data-start=\"4599\" data-end=\"4639\">D\u00e9pendance \u00e0 la qualit\u00e9 des mots-cl\u00e9s.<\/p>\n<\/li>\n<li data-start=\"4643\" data-end=\"4695\">\n<p data-start=\"4645\" data-end=\"4695\">Difficult\u00e9 \u00e0 g\u00e9rer la polys\u00e9mie et la synonymie.<\/p>\n<\/li>\n<li data-start=\"4699\" data-end=\"4755\">\n<p data-start=\"4701\" data-end=\"4755\">N\u00e9cessit\u00e9 de pr\u00e9traitement et de normalisation lourds.<\/p>\n<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<hr data-start=\"4757\" data-end=\"4760\" \/>\n<h3 data-start=\"4762\" data-end=\"4820\"><strong data-start=\"4766\" data-end=\"4820\">2.2 Approches modernes bas\u00e9es sur le Deep Learning<\/strong><\/h3>\n<ol data-start=\"4822\" data-end=\"5542\">\n<li data-start=\"4822\" data-end=\"5034\">\n<p data-start=\"4825\" data-end=\"4846\"><strong data-start=\"4825\" data-end=\"4844\">Word Embeddings<\/strong><\/p>\n<ul data-start=\"4850\" data-end=\"5034\">\n<li data-start=\"4850\" data-end=\"4948\">\n<p data-start=\"4852\" data-end=\"4948\"><strong data-start=\"4852\" data-end=\"4871\">Word2Vec, GloVe<\/strong> : repr\u00e9sentent les mots par des vecteurs continus capturant la s\u00e9mantique.<\/p>\n<\/li>\n<li data-start=\"4952\" data-end=\"5034\">\n<p data-start=\"4954\" data-end=\"5034\">Permettent une meilleure g\u00e9n\u00e9ralisation et une classification plus contextuelle.<\/p>\n<\/li>\n<\/ul>\n<\/li>\n<li data-start=\"5036\" data-end=\"5330\">\n<p data-start=\"5039\" data-end=\"5080\"><strong data-start=\"5039\" data-end=\"5078\">Mod\u00e8les contextuels et Transformers<\/strong><\/p>\n<ul data-start=\"5084\" data-end=\"5330\">\n<li data-start=\"5084\" data-end=\"5167\">\n<p data-start=\"5086\" data-end=\"5167\"><strong data-start=\"5086\" data-end=\"5108\">BERT, RoBERTa, GPT<\/strong> : capturent le contexte complet d\u2019un mot dans la phrase.<\/p>\n<\/li>\n<li data-start=\"5171\" data-end=\"5243\">\n<p data-start=\"5173\" data-end=\"5243\">Performances sup\u00e9rieures sur les t\u00e2ches de classification textuelle.<\/p>\n<\/li>\n<li data-start=\"5247\" data-end=\"5330\">\n<p data-start=\"5249\" data-end=\"5330\">Permettent le fine-tuning sur des corpus sp\u00e9cifiques pour am\u00e9liorer la pr\u00e9cision.<\/p>\n<\/li>\n<\/ul>\n<\/li>\n<li data-start=\"5332\" data-end=\"5542\">\n<p data-start=\"5335\" data-end=\"5388\"><strong data-start=\"5335\" data-end=\"5386\">R\u00e9seaux de neurones r\u00e9currents (RNN, LSTM, GRU)<\/strong><\/p>\n<ul data-start=\"5392\" data-end=\"5542\">\n<li data-start=\"5392\" data-end=\"5451\">\n<p data-start=\"5394\" data-end=\"5451\">Exploitent les d\u00e9pendances s\u00e9quentielles dans le texte.<\/p>\n<\/li>\n<li data-start=\"5455\" data-end=\"5542\">\n<p data-start=\"5457\" data-end=\"5542\">Limitations : difficult\u00e9 \u00e0 g\u00e9rer de tr\u00e8s longs documents sans m\u00e9canismes d\u2019attention.<\/p>\n<\/li>\n<\/ul>\n<\/li>\n<\/ol>\n<hr data-start=\"5544\" data-end=\"5547\" \/>\n<h3 data-start=\"5549\" data-end=\"5595\"><strong data-start=\"5553\" data-end=\"5595\">2.3 Revue comparative des performances<\/strong><\/h3>\n<div class=\"TyagGW_tableContainer\">\n<div class=\"group TyagGW_tableWrapper flex w-fit flex-col-reverse\" tabindex=\"-1\">\n<table class=\"w-fit min-w-(--thread-content-width)\" data-start=\"5597\" data-end=\"6246\">\n<thead data-start=\"5597\" data-end=\"5654\">\n<tr data-start=\"5597\" data-end=\"5654\">\n<th data-start=\"5597\" data-end=\"5607\" data-col-size=\"sm\">M\u00e9thode<\/th>\n<th data-start=\"5607\" data-end=\"5619\" data-col-size=\"md\">Avantages<\/th>\n<th data-start=\"5619\" data-end=\"5629\" data-col-size=\"md\">Limites<\/th>\n<th data-start=\"5629\" data-end=\"5654\" data-col-size=\"md\">Applications typiques<\/th>\n<\/tr>\n<\/thead>\n<tbody data-start=\"5711\" data-end=\"6246\">\n<tr data-start=\"5711\" data-end=\"5815\">\n<td data-start=\"5711\" data-end=\"5726\" data-col-size=\"sm\">TF-IDF + SVM<\/td>\n<td data-start=\"5726\" data-end=\"5750\" data-col-size=\"md\">Rapide, interpr\u00e9table<\/td>\n<td data-start=\"5750\" data-end=\"5779\" data-col-size=\"md\">Ne capture pas le contexte<\/td>\n<td data-start=\"5779\" data-end=\"5815\" data-col-size=\"md\">Classification th\u00e9matique simple<\/td>\n<\/tr>\n<tr data-start=\"5816\" data-end=\"5974\">\n<td data-start=\"5816\" data-end=\"5830\" data-col-size=\"sm\">Na\u00efve Bayes<\/td>\n<td data-start=\"5830\" data-end=\"5863\" data-col-size=\"md\">Efficace sur corpus volumineux<\/td>\n<td data-start=\"5863\" data-end=\"5925\" data-col-size=\"md\">Hypoth\u00e8se d\u2019ind\u00e9pendance, faible pr\u00e9cision sur textes longs<\/td>\n<td data-start=\"5925\" data-end=\"5974\" data-col-size=\"md\">Classification de spam, cat\u00e9gorisation rapide<\/td>\n<\/tr>\n<tr data-start=\"5975\" data-end=\"6088\">\n<td data-start=\"5975\" data-end=\"5991\" data-col-size=\"sm\">Word2Vec + NN<\/td>\n<td data-start=\"5991\" data-end=\"6025\" data-col-size=\"md\">Capture la s\u00e9mantique, flexible<\/td>\n<td data-start=\"6025\" data-end=\"6052\" data-col-size=\"md\">Besoin d\u2019un grand corpus<\/td>\n<td data-start=\"6052\" data-end=\"6088\" data-col-size=\"md\">Analyse th\u00e9matique et clustering<\/td>\n<\/tr>\n<tr data-start=\"6089\" data-end=\"6246\">\n<td data-start=\"6089\" data-end=\"6111\" data-col-size=\"sm\">BERT \/ Transformers<\/td>\n<td data-start=\"6111\" data-end=\"6156\" data-col-size=\"md\">Pr\u00e9cision \u00e9lev\u00e9e, capture contexte complet<\/td>\n<td data-start=\"6156\" data-end=\"6185\" data-col-size=\"md\">Tr\u00e8s co\u00fbteux en ressources<\/td>\n<td data-start=\"6185\" data-end=\"6246\" data-col-size=\"md\">Classification fine, recommandation, extraction d\u2019entit\u00e9s<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/div>\n<\/div>\n<hr data-start=\"6248\" data-end=\"6251\" \/>\n<h2 data-start=\"6253\" data-end=\"6275\"><strong data-start=\"6256\" data-end=\"6275\">3. M\u00e9thodologie<\/strong><\/h2>\n<ol data-start=\"6277\" data-end=\"6761\">\n<li data-start=\"6277\" data-end=\"6377\">\n<p data-start=\"6280\" data-end=\"6377\"><strong data-start=\"6280\" data-end=\"6297\">Pr\u00e9traitement<\/strong> : nettoyage du texte, tokenisation, suppression des stopwords, lemmatisation.<\/p>\n<\/li>\n<li data-start=\"6378\" data-end=\"6466\">\n<p data-start=\"6381\" data-end=\"6466\"><strong data-start=\"6381\" data-end=\"6398\">Vectorisation<\/strong> : TF-IDF pour m\u00e9thodes classiques, embeddings pour deep learning.<\/p>\n<\/li>\n<li data-start=\"6467\" data-end=\"6552\">\n<p data-start=\"6470\" data-end=\"6552\"><strong data-start=\"6470\" data-end=\"6486\">Entra\u00eenement<\/strong> : mod\u00e8les supervis\u00e9s (SVM, Na\u00efve Bayes) ou fine-tuning de BERT.<\/p>\n<\/li>\n<li data-start=\"6553\" data-end=\"6648\">\n<p data-start=\"6556\" data-end=\"6648\"><strong data-start=\"6556\" data-end=\"6570\">\u00c9valuation<\/strong> : pr\u00e9cision, rappel, F1-score, AUC pour mesurer la performance des mod\u00e8les.<\/p>\n<\/li>\n<li data-start=\"6649\" data-end=\"6761\">\n<p data-start=\"6652\" data-end=\"6761\"><strong data-start=\"6652\" data-end=\"6675\">Analyse comparative<\/strong> : benchmarking sur des corpus scientifiques existants (arXiv, PubMed, ScienceDirect).<\/p>\n<\/li>\n<\/ol>\n<hr data-start=\"6763\" data-end=\"6766\" \/>\n<h2 data-start=\"6768\" data-end=\"6804\"><strong data-start=\"6771\" data-end=\"6804\">4. Discussion et perspectives<\/strong><\/h2>\n<ul data-start=\"6806\" data-end=\"7453\">\n<li data-start=\"6806\" data-end=\"6958\">\n<p data-start=\"6808\" data-end=\"6958\">Les <strong data-start=\"6812\" data-end=\"6835\">mod\u00e8les contextuels<\/strong> (BERT et variantes) surpassent syst\u00e9matiquement les m\u00e9thodes classiques pour la classification d\u2019articles scientifiques.<\/p>\n<\/li>\n<li data-start=\"6959\" data-end=\"7140\">\n<p data-start=\"6961\" data-end=\"7140\">Les <strong data-start=\"6965\" data-end=\"6986\">m\u00e9thodes hybrides<\/strong>, combinant TF-IDF avec embeddings ou BERT avec techniques de filtrage collaboratif, offrent des solutions robustes pour des syst\u00e8mes de recommandation.<\/p>\n<\/li>\n<li data-start=\"7141\" data-end=\"7279\">\n<p data-start=\"7143\" data-end=\"7279\">Les d\u00e9fis restent la <strong data-start=\"7164\" data-end=\"7179\">scalabilit\u00e9<\/strong>, la <strong data-start=\"7184\" data-end=\"7232\">compr\u00e9hension des longs textes scientifiques<\/strong> et la <strong data-start=\"7239\" data-end=\"7276\">mise \u00e0 jour dynamique des mod\u00e8les<\/strong>.<\/p>\n<\/li>\n<li data-start=\"7280\" data-end=\"7453\">\n<p data-start=\"7282\" data-end=\"7453\">L\u2019avenir s\u2019oriente vers <strong data-start=\"7306\" data-end=\"7336\">l\u2019int\u00e9gration multi-omique<\/strong>, le TAL pour le <strong data-start=\"7353\" data-end=\"7375\">r\u00e9sum\u00e9 automatique<\/strong> et la <strong data-start=\"7382\" data-end=\"7412\">classification multi-label<\/strong> dans des bases de donn\u00e9es scientifiques.<\/p>\n<\/li>\n<\/ul>\n<hr data-start=\"7455\" data-end=\"7458\" \/>\n<h2 data-start=\"7460\" data-end=\"7480\"><strong data-start=\"7463\" data-end=\"7480\">5. Conclusion<\/strong><\/h2>\n<p data-start=\"7482\" data-end=\"7965\">Le traitement automatique du langage permet aujourd\u2019hui de classifier efficacement des articles scientifiques, am\u00e9liorant l\u2019acc\u00e8s \u00e0 l\u2019information et la veille scientifique. Les m\u00e9thodes modernes bas\u00e9es sur le deep learning et les mod\u00e8les contextuels repr\u00e9sentent un progr\u00e8s significatif par rapport aux approches traditionnelles. N\u00e9anmoins, des d\u00e9fis subsistent concernant la gestion de corpus volumineux, la complexit\u00e9 computationnelle et la mise \u00e0 jour dynamique des connaissances.<\/p>\n<hr data-start=\"7967\" data-end=\"7970\" \/>\n<h2 data-start=\"7972\" data-end=\"8006\"><strong data-start=\"7975\" data-end=\"8006\">6. R\u00e9f\u00e9rences scientifiques<\/strong><\/h2>\n<ol data-start=\"8008\" data-end=\"9100\">\n<li data-start=\"8008\" data-end=\"8163\">\n<p data-start=\"8011\" data-end=\"8163\">Mikolov, T., Chen, K., Corrado, G., &amp; Dean, J. (2013). <em data-start=\"8066\" data-end=\"8128\">Efficient estimation of word representations in vector space<\/em>. arXiv preprint arXiv:1301.3781.<\/p>\n<\/li>\n<li data-start=\"8164\" data-end=\"8322\">\n<p data-start=\"8167\" data-end=\"8322\">Devlin, J., Chang, M.-W., Lee, K., &amp; Toutanova, K. (2019). <em data-start=\"8226\" data-end=\"8308\">BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding<\/em>. NAACL-HLT.<\/p>\n<\/li>\n<li data-start=\"8323\" data-end=\"8479\">\n<p data-start=\"8326\" data-end=\"8479\">Joachims, T. (1998). <em data-start=\"8347\" data-end=\"8435\">Text categorization with Support Vector Machines: Learning with many relevant features<\/em>. European Conference on Machine Learning.<\/p>\n<\/li>\n<li data-start=\"8480\" data-end=\"8596\">\n<p data-start=\"8483\" data-end=\"8596\">Sebastiani, F. (2002). <em data-start=\"8506\" data-end=\"8557\">Machine learning in automated text categorization<\/em>. ACM Computing Surveys, 34(1), 1\u201347.<\/p>\n<\/li>\n<li data-start=\"8597\" data-end=\"8685\">\n<p data-start=\"8600\" data-end=\"8685\">Kim, Y. (2014). <em data-start=\"8616\" data-end=\"8675\">Convolutional Neural Networks for Sentence Classification<\/em>. EMNLP.<\/p>\n<\/li>\n<li data-start=\"8686\" data-end=\"8848\">\n<p data-start=\"8689\" data-end=\"8848\">Zhang, Y., &amp; Wallace, B. (2017). <em data-start=\"8722\" data-end=\"8837\">A Sensitivity Analysis of (and Practitioners\u2019 Guide to) Convolutional Neural Networks for Sentence Classification<\/em>. IJCNLP.<\/p>\n<\/li>\n<li data-start=\"8849\" data-end=\"8972\">\n<p data-start=\"8852\" data-end=\"8972\">Aggarwal, C. C., &amp; Zhai, C. (2012). <em data-start=\"8888\" data-end=\"8932\">A survey of text classification algorithms<\/em>. Mining Text Data, Springer, 163\u2013222.<\/p>\n<\/li>\n<li data-start=\"8973\" data-end=\"9100\">\n<p data-start=\"8976\" data-end=\"9100\">Araci, D. (2019). <em data-start=\"8994\" data-end=\"9066\">FinBERT: Financial Sentiment Analysis with Pre-trained Language Models<\/em>. arXiv preprint arXiv:1908.10063.<\/p>\n<\/li>\n<\/ol>\n<hr data-start=\"9102\" data-end=\"9105\" \/>\n<p data-start=\"9107\" data-end=\"9349\">\n","protected":false},"excerpt":{"rendered":"<p>Traitement automatique du langage pour la classification d&#8217;articles Auteur(s) : Dr. Ali Diop \u2014 Date : 2020-05-05 \u2014 Source : arXiv R\u00e9sum\u00e9 (FR) Le traitement automatique du langage (TAL) constitue aujourd\u2019hui un outil fondamental pour organiser et analyser de vastes corpus textuels dans le domaine scientifique. La classification d\u2019articles, en particulier, permet d\u2019identifier automatiquement le [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":6371,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_bbp_topic_count":0,"_bbp_reply_count":0,"_bbp_total_topic_count":0,"_bbp_total_reply_count":0,"_bbp_voice_count":0,"_bbp_anonymous_reply_count":0,"_bbp_topic_count_hidden":0,"_bbp_reply_count_hidden":0,"_bbp_forum_subforum_count":0,"footnotes":""},"categories":[108],"tags":[],"class_list":["post-6224","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-informatique-intelligence-artificielle"],"acf":[],"_links":{"self":[{"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/posts\/6224","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/comments?post=6224"}],"version-history":[{"count":1,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/posts\/6224\/revisions"}],"predecessor-version":[{"id":6373,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/posts\/6224\/revisions\/6373"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/media\/6371"}],"wp:attachment":[{"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/media?parent=6224"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/categories?post=6224"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sahelib.atatec-design.com\/index.php\/wp-json\/wp\/v2\/tags?post=6224"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}