Page 2 of 10
303
Kang, N. (2023). The South Korea Halloween Crush in CNN News: A NetMiner Analysis. Advances in Social Sciences Research Journal, 10(1). 302-311.
URL: http://dx.doi.org/10.14738/assrj.101.13796
that topic 2 was the most occurred one in 40 pieces of CNN news, followed by topic 7, topic 6,
and topic 1. We also contend that the probability for the word Seoul to be the first keyword is
the highest (0.136). In section 2.4, we argue that the word person was the most occurred one in
40 pieces of CNN news, followed by the word crowd, the word Seoul, the word police, the word
Itaewon, and the word crush, in that order. Finally, we show that the keywords accident, call,
official, response, report, and government are directly linked to the word emergency. These links
seem to show that people in danger requested for rescue.
RESULTS
Frequency of Words and Their Proportion
In what follows, we aim at examining the frequency of nouns occurred in 40 pieces of CNN news
and their proportion. Table 1 shows the frequency of words used in the CNN news, their
proportion, and their cumulative proportion:
Table 1 Frequency of nouns
Value Frequency Proportion Cumulative Proportion
1.0 443 0.394 0.394
2.0 210 0.187 0.58
3.0 109 0.097 0.677
4.0 70 0.062 0.74
5.0 52 0.046 0.786
6.0 29 0.026 0.812
7.0 29 0.026 0.837
8.0 19 0.017 0.854
9.0 19 0.017 0.871
10.0 15 0.013 0.884
11.0 6 0.005 0.89
12.0 10 0.009 0.899
13.0 10 0.009 0.908
14.0 8 0.007 0.915
15.0 15 0.013 0.928
16.0 8 0.007 0.935
17.0 8 0.007 0.942
18.0 4 0.004 0.946
19.0 3 0.003 0.948
20.0 3 0.003 0.951
21.0 3 0.003 0.954
22.0 1 0.001 0.955
23.0 3 0.003 0.957
24.0 2 0.002 0.959
25.0 1 0.001 0.96
26.0 2 0.002 0.962
27.0 2 0.002 0.964
29.0 1 0.001 0.964
30.0 3 0.003 0.967
31.0 4 0.004 0.971
32.0 2 0.002 0.972
Page 3 of 10
304
Advances in Social Sciences Research Journal (ASSRJ) Vol. 10, Issue 1, January-2023
Services for Science and Education – United Kingdom
34.0 2 0.002 0.974
36.0 2 0.002 0.976
39.0 3 0.003 0.979
43.0 1 0.001 0.98
45.0 1 0.001 0.98
46.0 1 0.001 0.981
49.0 2 0.002 0.983
50.0 1 0.001 0.984
52.0 2 0.002 0.986
53.0 1 0.001 0.987
54.0 1 0.001 0.988
55.0 2 0.002 0.989
59.0 1 0.001 0.99
62.0 2 0.002 0.992
63.0 1 0.001 0.993
87.0 1 0.001 0.994
100.0 1 0.001 0.995
102.0 1 0.001 0.996
122.0 1 0.001 0.996
151.0 1 0.001 0.997
178.0 1 0.001 0.998
187.0 1 0.001 0.999
265.0 1 0.001 1
Total 1125 1
It is probably worthwhile pointing out that one word has the highest frequency (443 tokens)
and the highest proportion (0.394) in 40 pieces of CNN news. It is worth mentioning, on the
other hand, that the frequency of two words used in the CNN news is 210 tokens (the second
highest). It should be noted that their proportion is 0.187 and their cumulative proportion is
0.58. I think it must be pointed out that there are three words used in the CNN news and that
their proportion and their cumulative proportion are 0.097 and 0.677, respectively.
Note that their frequency is 109 tokens and it ranks third (the third highest). Additionally, there
are four words whose frequency is 70 tokens. Notice that there are five words that often
occurred in the CNN news and their frequency is 52 tokens (the fifth highest). As exemplified
in Table 1, the overall frequency of all words occurred in the CNN news is 1,125 tokens and
their proportion is 1 (100%). We thus conclude that one word has the highest frequency (443
tokens) and the highest proportion (0.394) in the CNN news.
Word Cloud
In the following, we provide word cloud in which the relevant keywords are represented in
different sizes. Table 2 shows word cloud that is related to 40 pieces of CNN news: