<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article
PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.4 20190208//EN"
       "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" article-type="research-article" dtd-version="1.4" xml:lang="en">
 <front>
  <journal-meta>
   <journal-id journal-id-type="publisher-id">Automation and modeling in design and management</journal-id>
   <journal-title-group>
    <journal-title xml:lang="en">Automation and modeling in design and management</journal-title>
    <trans-title-group xml:lang="ru">
     <trans-title>Автоматизация и моделирование в проектировании и управлении</trans-title>
    </trans-title-group>
   </journal-title-group>
   <issn publication-format="print">2658-3488</issn>
   <issn publication-format="online">2658-6436</issn>
  </journal-meta>
  <article-meta>
   <article-id pub-id-type="publisher-id">126244</article-id>
   <article-id pub-id-type="doi">10.30987/2658-6436-2026-2-27-33</article-id>
   <article-categories>
    <subj-group subj-group-type="toc-heading" xml:lang="ru">
     <subject>МАТЕМАТИЧЕСКОЕ И КОМПЬЮТЕРНОЕ МОДЕЛИРОВАНИЕ</subject>
    </subj-group>
    <subj-group subj-group-type="toc-heading" xml:lang="en">
     <subject>MATHEMATICAL AND COMPUTER MODELING</subject>
    </subj-group>
    <subj-group>
     <subject>МАТЕМАТИЧЕСКОЕ И КОМПЬЮТЕРНОЕ МОДЕЛИРОВАНИЕ</subject>
    </subj-group>
   </article-categories>
   <title-group>
    <article-title xml:lang="en">DEVELOPING A METHODOLOGY FOR CLASSIFYING COMMITS  IN GIT-REPOSITORIES USING MACHINE LEARNING</article-title>
    <trans-title-group xml:lang="ru">
     <trans-title>РАЗРАБОТКА МЕТОДОЛОГИИ КЛАССИФИКАЦИИ КОММИТОВ  В GIT-РЕПОЗИТОРИЯХ С ИСПОЛЬЗОВАНИЕМ МАШИННОГО ОБУЧЕНИЯ</trans-title>
    </trans-title-group>
   </title-group>
   <contrib-group content-type="authors">
    <contrib contrib-type="author">
     <contrib-id contrib-id-type="orcid">https://orcid.org/0000-0003-4095-7029</contrib-id>
     <name-alternatives>
      <name xml:lang="ru">
       <surname>Копелиович</surname>
       <given-names>Дмитрий Игоревич</given-names>
      </name>
      <name xml:lang="en">
       <surname>Kopeliovich</surname>
       <given-names>Dmitriy Igorevich</given-names>
      </name>
     </name-alternatives>
     <email>dkopeliovich@rambler.ru</email>
     <bio xml:lang="ru">
      <p>кандидат технических наук;</p>
     </bio>
     <bio xml:lang="en">
      <p>candidate of technical sciences;</p>
     </bio>
     <xref ref-type="aff" rid="aff-1"/>
    </contrib>
    <contrib contrib-type="author">
     <contrib-id contrib-id-type="orcid">https://orcid.org/0009-0008-1270-8918</contrib-id>
     <name-alternatives>
      <name xml:lang="ru">
       <surname>Кургуз</surname>
       <given-names>Михаил Андреевич</given-names>
      </name>
      <name xml:lang="en">
       <surname>Kurguz</surname>
       <given-names>Mihail Andreevich</given-names>
      </name>
     </name-alternatives>
     <email>mihail.kurguz2016@yandex.ru</email>
     <xref ref-type="aff" rid="aff-2"/>
    </contrib>
   </contrib-group>
   <aff-alternatives id="aff-1">
    <aff>
     <institution xml:lang="ru">Брянский государственный технический университет</institution>
     <country>RU</country>
    </aff>
    <aff>
     <institution xml:lang="en">Bryansk State Technical University</institution>
     <country>RU</country>
    </aff>
   </aff-alternatives>
   <aff-alternatives id="aff-2">
    <aff>
     <institution xml:lang="ru">Брянский государственный технический университет</institution>
    </aff>
    <aff>
     <institution xml:lang="en">Bryansk State Technical University</institution>
    </aff>
   </aff-alternatives>
   <pub-date publication-format="print" date-type="pub" iso-8601-date="2026-04-20T00:00:00+03:00">
    <day>20</day>
    <month>04</month>
    <year>2026</year>
   </pub-date>
   <pub-date publication-format="electronic" date-type="pub" iso-8601-date="2026-04-20T00:00:00+03:00">
    <day>20</day>
    <month>04</month>
    <year>2026</year>
   </pub-date>
   <volume>2026</volume>
   <issue>2</issue>
   <fpage>27</fpage>
   <lpage>33</lpage>
   <history>
    <date date-type="received" iso-8601-date="2026-03-24T00:00:00+03:00">
     <day>24</day>
     <month>03</month>
     <year>2026</year>
    </date>
    <date date-type="accepted" iso-8601-date="2026-04-15T00:00:00+03:00">
     <day>15</day>
     <month>04</month>
     <year>2026</year>
    </date>
   </history>
   <self-uri xlink:href="https://zh-szf.ru/en/nauka/article/126244/view">https://zh-szf.ru/en/nauka/article/126244/view</self-uri>
   <abstract xml:lang="ru">
    <p>Представлена разработка методологии и программного обеспечения для автоматической классификации коммитов в Git-репозиториях с использованием методов машинного обучения. Предложенный подход сочетает в себе текстовую векторизацию на основе TF-IDF и модель Multinominal Naive Bayes для классификации коммитов по категориям. Подход включает в себя систему активного обучения, которая дает пользователю возможность корректировать предлагаемые классификации, что способствует непрерывному совершенствованию модели. Методология включает предварительную обработку описаний коммитов, извлечение семантических признаков и построение адаптивной классификационной модели. Результаты работы могут быть использованы для повышения прозрачности процессов разработки, анализа историй изменений, анализа и оптимизации кода и автоматизации процессов тестирования и доставки новых модулей разрабатываемого проекта заинтересованным сторонам (CI/CD).</p>
   </abstract>
   <trans-abstract xml:lang="en">
    <p>This paper presents the development of a methodology and software for automatically classifying commits in Git-repositories using machine learning methods. The proposed approach combines text vectorization based on TF-IDF and the Multinomial Naive Bayes model for classifying commits into categories. The approach includes an active learning system that allows the user to adjust the proposed classifications, facilitating continuous model improvement. The methodology includes preprocessing commit descriptions, extracting semantic features, and building an adaptive classification model. The results of this work can be used to improve the transparency of development processes, to analyze change histories, to analyze and optimize code, and to automate testing and delivery of new modules of the project being developed to stakeholders (Continuous Integration / Continuous Delivery).</p>
   </trans-abstract>
   <kwd-group xml:lang="ru">
    <kwd>машинное обучение</kwd>
    <kwd>Git</kwd>
    <kwd>классификации коммитов</kwd>
    <kwd>активное обучение</kwd>
    <kwd>TF-IDF</kwd>
    <kwd>Multinominal Naive Bayes</kwd>
   </kwd-group>
   <kwd-group xml:lang="en">
    <kwd>machine learning</kwd>
    <kwd>Git</kwd>
    <kwd>commit classification</kwd>
    <kwd>active learning</kwd>
    <kwd>TF-IDF</kwd>
    <kwd>Multinomial Naive Bayes</kwd>
   </kwd-group>
  </article-meta>
 </front>
 <body>
  <p></p>
 </body>
 <back>
  <ref-list>
   <ref id="B1">
    <label>1.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Крупкин С.А. Работа с системой контроля версий Git [Текст]. – М.: Изд-во Московского университета, 2022. – 120 с.</mixed-citation>
     <mixed-citation xml:lang="en">Krupkin SA. Working with the Git Version Control System. Moscow: Moscow University Press; 2022.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B2">
    <label>2.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Automated commit classification for git repositories using machine learning technique [Текст] / X. Wang, Y. Jiang, Y. Xu et al. // Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023). – 2023. – P. 112-124.</mixed-citation>
     <mixed-citation xml:lang="en">Wang X, Jiang Y, Xu Y, et al. Automated Commit Classification for Git Repositories Using Machine Learning Technique. In: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Soft-ware Engineering (ESEC/FSE 2023): 2023. p. 112-124.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B3">
    <label>3.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Конвенция коммитов (Conventional Commits) [Электронный ресурс] / Conventional Commits Initiative. – Version 1.0.0, 2019.</mixed-citation>
     <mixed-citation xml:lang="en">Conventional Commits. Conventional Commits Initiative. Version 1.0.0 [Internet]. 2019.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B4">
    <label>4.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Иванов Н.Н. Синтаксический разбор предложения для векторизации текста // Вопросы науки и образования. – 2017. – №11 (12). – С. 45-46.</mixed-citation>
     <mixed-citation xml:lang="en">Ivanov N.N. Syntactic Parsing of a Sentence for Text Vectorization. Problems of Science and Education. 2017;11:45-46.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B5">
    <label>5.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Zhang H., Jiang L., Yu H.-K. A literature review on naive Bayes classifiers // Intelligent Data Analysis. – 2020. – Vol. 24. – No. 1. – P. 37-57.</mixed-citation>
     <mixed-citation xml:lang="en">Zhang H., Jiang L., Yu H-K. A Literature Review on Naive Bayes Classifiers. Intelligent Data Analysis. 2020;24(1):37-57.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B6">
    <label>6.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Гусев П.Ю. Обработка текстов и подготовка моделей векторизации для программного комплекса классификации научных текстов // Моделирование, оптимизация и информационные технологии. – 2021. – №9 (1).</mixed-citation>
     <mixed-citation xml:lang="en">Gusev PY. Word Processing and Preparation of Vectorization Models for a Software Package for the Classification of Scientific Texts. Modelling, Optimization and Information Technology. 2021;9(1).</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B7">
    <label>7.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Terentyeva Yu. Sentiment Analysis, InSet Lexicon, SentiStrength Lexicon, Naive Bayes, Multinomial Naive Bayes, TF-IDF, Machine Learning // International Journal of Open Information Technologies. – 2024. – №7. URL: https://cyberleninka.ru/article/n/ sentiment-analysis-inset-lexicon-sentistrength-lexicon-naive-bayes-multinomial-naive-bayes-tf-idf-machine-learning (дата обращения: 10.01.2026).</mixed-citation>
     <mixed-citation xml:lang="en">Terentyeva Yu. Sentiment Analysis, InSet Lexicon, SentiStrength Lexicon, Naive Bayes, Multinomial Naive Bayes, TF-IDF, Machine Learning. International Journal of Open Information Technologies [Internet]. 2024 [cited 2026 Jan 10];7. Available from: https://cyberleninka.ru/article/n/sentiment-analysis-inset-lexicon-sentistrength-lexicon-naive-bayes-multinomial-naive-bayes-tf-idf-machine-learning</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B8">
    <label>8.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Pascarella L. On the Use of Machine Learning Techniques for Software Engineering Tasks: A Systematic Literature Review // IEEE Transactions on Software Engineering. – 2021. – Т. 47, № 11. – С. 2301-2325.</mixed-citation>
     <mixed-citation xml:lang="en">Pascarella L. On the Use of Machine Learning Techniques for Software Engineering Tasks: A Systematic Literature Review. IEEE Transactions on Software Engineering. 2021;47(11):2301-2325.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B9">
    <label>9.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Zhang Y., Wang H., Liu Z. A Comparative Study of Text Classification Algorithms // Journal of Machine Learning Research. – 2018. – Vol. 19. – P. 1-35.</mixed-citation>
     <mixed-citation xml:lang="en">Zhang Y., Wang H., Liu Z. A Comparative Study of Text Classification Algorithms. Journal of Machine Learning Research. 2018;19:1-35.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B10">
    <label>10.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Chen M., Li X., Zhou J. Scalable Text Classification: A Benchmark // Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL). – 2020. – P. 4567-4579.</mixed-citation>
     <mixed-citation xml:lang="en">Chen M, Li X, Zhou J. Scalable Text Classification: A Benchmark. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (ACL): 2020. p. 4567-4579.</mixed-citation>
    </citation-alternatives>
   </ref>
   <ref id="B11">
    <label>11.</label>
    <citation-alternatives>
     <mixed-citation xml:lang="ru">Wang T., Jiang L., Chen R. Noise-Robust Text Classification with Naive Bayes // Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). – 2019. – P. 1234-1243.</mixed-citation>
     <mixed-citation xml:lang="en">Wang T, Jiang L, Chen R. Noise-Robust Text Classification with Naive Bayes. In: Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP): 2019. p. 1234-1243.</mixed-citation>
    </citation-alternatives>
   </ref>
  </ref-list>
 </back>
</article>
