Natural Language Processing for Text in Python

Authors

  • Karshiyev Abduvali Berkinovich Professor of Samarkand branch of Tashkent University of information technologies named after Muhammad al-Khwarizmi, Samarkand, Uzbekistan
  • Mamaraimov Mirjalol Shakarboyivich A Master of Samarkand branch of Tashkent University of information technologies named after Muhammad al-Khwarizmi, Samarkand, Uzbekistan

Keywords:

pipeline, NLP, spaCy, Python, part-of-speech, lemmatization

Abstract

This article covers the basic principles of natural language processing (NLP) technologies and how they can be used in the Python programming language. Processes such as tokenization, stemming, and lemmatization are consistently described. Accordingly, methods of text tokenization using spaCy library tools and using lemma, POS, tag, stop attributes created through the pipeline process are provided.

Downloads

Published

2024-06-24