Processing of textual data

Processing of textual data

1.

Subject title

Processing of textual data

Обработка на текстуални податоци

2.

Code

m23_w_045

3.

Study program

IT management, Bioinformatics, Security, Cryptography and Coding, Cloud Computing, Еducation with ICT, Eco-informatics, Inteligent Systems, Internet Technologies and cyber security, Computer Science, Statistics and Data Analytics, Software for embedded systems, Software Engineering, Cloud Computing, IT management, Bioinformatics, Security, Cryptography and Coding, Statistics and Data Analytics, Software Engineering, Data science in computer science and engineering,

4.

Organizer of the study program (unit, institute, department, division)

Faculty of Information Sciences and Computer Engineering

5.

Study cycle (first, second, third)

Втор циклус

6.

Academic year / semester

5 / Зимски

7. Number of ECTS credits

6.0

8.

Instructor

проф. д-р Катерина Здравкова

9.

Prerequisites for enrollment

10.

Subject goals and competencies:


Upon successful completion of this subject, the student will be able to Recognize key problems arising from using textual Data, get to know the most appropriate algorithms for their dismissal, to Explore and analyze text data and to discover new knowledge, to know and apply existing text data processing tools and to create own solutions for the realization of certain tasks in the field of processing and mining of text documents and data.

11.

Subject content:


Introduction to the processing of text data and textual mining ;; extraction, classification, categorization and grouping of data ;; Basic Techniques of Text Mining ;; Text Mining and Machine learning ;; Examples of Application of Text Mining Techniques different applications ;; future trends in processing text data.

12.

Learning methods:


предавања, проекти, дискусии, работилници

13.

Total available time fund

6.0 ECTS x 30 hours = 180 hours

14.

Time distribution

30 + 45 + 0 + 0 + 0 = 180 hours

15.

Forms of teaching activities

15.1.

Lectures - theoretical teaching

30 hours

15.2.

Exercises (laboratory, classroom), seminars, team work

45 hours

16.

Other forms of activities

16.1.

Project tasks

0 hours

16.2.

Independent tasks

0 hours

16.3.

Homework

0 hours

17.

Grading method

17.1.

Tests

0 points

17.2.

Seminar work / project (presentation: written and oral)

0 points

17.3.

Activities and learning

0 points

17.4.

Final exam

0 points

18.

Grading criteria (points / grade)

up to 50 points

5 (five) (F)

from 51 to 60 points

6 (six) (E)

from 61 to 70 points

7 (seven) (D)

from 71 to 80 points

8 (eight) (C)

from 81 to 90 points

9 (nine) (B)

from 91 to 100 points

10 (ten) (A)

19.

Condition for signature and taking final exam

Реализирани активности 15, 16

20.

Language of instruction

македонски или англиски

21.

Quality assurance method

-

22.

Literature

22.1.

Mandatory literature

No.

Author

Title

Publisher

Year

6581

Dipanjan Sarkar

Text Analytics with Python: A Practical Real-World Approach to Gaining Actionable Insights from your Data

Apress

2016

6582

Rob Miller

Text Processing with Ruby: Extract Value from the Data That Surrounds You

Pragmatic Bookshelf

2015

6583

Gabe Ignatow,‎ Rada F. Mihalcea

Text Mining: A Guidebook For The Social Sciences

SAGE Publications

2016

22.2.

Additional literature

No.

Author

Title

Publisher

Year