view memo/result.txt @ 43:724c29abb137

add result
author Masataka Kohagura <kohagura@cr.ie.u-ryukyu.ac.jp>
date Wed, 10 Feb 2016 17:34:17 +0900
parents 8576011b8447
children cfaafa209424
line wrap: on
line source

Wed Feb 10 11:06:12 JST 2016

./cerium/ceriumGrep -regex '[A-Z][a-zA-Z0-9_]*' -file file/500MB.txt >   25.29s user 0.53s system 100% cpu 25.721 total

[キャッシュ有]
cpu time
 1  25.721
 2  15.518
 3  12.260
 4  11.165
 5   9.659
 6   9.183
 7   8.551
 8   8.180

[キャッシュ無 : bread]
 1  30.682
 2  19.841
 3  17.822
 4  16.497
 5  16.491
 6  14.145
 7  16.375
 8  15.907

[キャッシュ無 : mmap]
 1  35.783
 2  22.758
 3  19.466
 4  17.189
 5  15.653
 6  16.141
 7  15.388
 8  15.901

[キャッシュ無]
./regexParser -subset -regex '[A-Z][a-zA-Z0-9_]*' -ts -file file/500MB.txt >   16.06s user 0.22s system 77% cpu 21.139 total

[キャッシュ有]
./regexParser -subset -regex '[A-Z][a-zA-Z0-9_]*' -ts -file file/500MB.txt >   16.05s user 0.19s system 99% cpu 16.246 total

cgrep -G '[A-Z][a-zA-Z0-9_]*' file/500MB.txt --no-line-umber --no-filename >/dev/null
測れない(2時間ぐらいぶんまわしてた)

egrep -o '(a|b)*a(a|b)(a|b)(a|b)(a|b)(a|b)(a|b)(a|b)' file/ab500MB.txt >   113.08s user 0.21s system 99% cpu 1:53.29 total
12503552

egrep -o '(a|b)*a(a|b)(a|b)(a|b)(a|b)(a|b)(a|b)' file/ab500MB.txt > /dev/null  103.32s user 0.18s system 99% cpu 1:43.50 total
14066496

egrep -o '(a|b)*a(a|b)(a|b)(a|b)(a|b)(a|b)' file/ab500MB.txt > /dev/null  98.29s user 0.18s system 99% cpu 1:38.47 total
15629440

egrep -o '(a|b)*a(a|b)(a|b)(a|b)(a|b)' file/ab500MB.txt > /dev/null  94.72s user 0.18s system 99% cpu 1:34.89 total
16410912

egrep -o '(a|b)*a(a|b)(a|b)(a|b)' file/ab500MB.txt > /dev/null  90.15s user 0.19s system 99% cpu 1:30.33 total
line:16410912

egrep -o '(a|b)*a(a|b)(a|b)' file/ab500MB.txt > /dev/null  82.88s user 0.20s system 99% cpu 1:23.09 total
line:19536800

egrep -o '[A-Z][a-zA-Z0-9_]*' 500MB.txt > /dev/null
56.34s user 0.16s system 99% cpu 56.506 total
line:13260580

sudo purge 後(キャッシュ消した)
egrep -o '[A-Z][a-zA-Z0-9_]*' 500MB.txt > /dev/null
57.37s user 0.22s system 98% cpu 58.382 total

キャッシュがあってもなってもかわらない。
(ファイルを毎回読み込みながら grep してる?)

-------------------------------------------

Mon Feb  8 17:24:16 JST 2016
compare cprep egrep ceriumgrep seqsearch
500MB.txt '[A-Z][a-zA-Z0-9_]*'
ab500MB.txt '(a|b)*a(a|b)(a|b)'



time cgrep -G '[A-Z][a-zA-Z0-9_]*' ../../Game/Cerium/example/bm_search/1GB.txt --no-line-umber --no-filename >/dev/null

[word count]
    firefly
    cpu time
     1  7.408101
     2  3.800094
     3  2.593649
     4  1.982035
     5  1.609130
     6  1.356986
     7  1.171626
     8  1.038483
     9  0.931845
    10  0.851650
    11  0.783369
    12  0.741725
    13  0.729744
    14  0.721221
    15  0.706474
    16  0.694984

    [mmap]
     1 19.124272
     4 17.701034
     8 17.517347
    16 16.844748

    [bread]
     1 15.219672
     4 15.892460
     8 13.709429
    16 13.913612

----------------------------------------
cache の消しかた
%sudo purge

./cerium/ceriumGrep  -regex '[A-Z][A-Za-z]*' -file ../../../Game/Cerium/example/bm_search/1GB.txt -cpu 16 -br

[firefly]
    [キャッシュ有り : file 読み込み時間なし]
    firefly
    | CPU | time |
     1   85.171171
     2   55.709298
     3   48.688031
     4   42.053209
     5   40.690125
     6   37.075352
     7   34.771558
     8   36.138412
     9   33.190304
    10   35.892051
    11   33.734864
    12   31.231748
    13   32.997263
    14   31.953924
    15   31.359396
    16   31.367073

    [キャッシュ無し : file 読み込み時間あり]
    [mmap]
     1   96.669395
     4   47.382920
     8   40.574622
    16   41.616542

    [bread]
     1   84.327310
     4   44.930445
     8   43.237358
    16   42.504598

    egrep -o

    [キャッシュ無し]
    egrep -o '[A-Z][a-zA-Z0-9_]*' ../../../Game/Cerium/example/bm_search/1GB.txt
    110.78s user 24.05s system 99% cpu 2:15.22 total

    [キャッシュ有り]
    egrep -o '[A-Z][a-zA-Z0-9_]*' ../../../Game/Cerium/example/bm_search/1GB.txt
    111.36s user 24.28s system 99% cpu 2:16.33 total



    ./cerium/ceriumGrep  -regex '(a|b)*a(a|b)(a|b)' -file ../../../Game/Cerium/example/bm_search/1GB.txt -cpu 8
    cpu time
     1  58.409044
     2  30.587006
     3  19.761497
     4  15.099642
     5  12.150340
     6  10.202328
     7  8.794964
     8   7.791925
     9   6.884088
    10   6.195592
    11   5.702492
    12   5.412080
    13   5.330420
    14   5.247614
    15   5.165163
    16   5.115427

    [mmap]
     1  70.830896
     4  23.777594
     8  16.743966
    16  15.853613

    [bread]
     1  58.259406
     4  19.307748
     8  17.217379
    16  15.243179

    egrep -o '(a|b)*a(a|b)(a|b)' ../../../Game/Cerium/example/bm_search/1GB.txt
    106.43s user 0.32s system 99% cpu 1:46.75 total

----------------------------------------------------------------
    firefly
    (a|b) の数を増やしてみる
    ./cerium/ceriumGrep -subset -regex '(a|b)*a(a|b)[...]' -file file/ab1GB.txt -cpu 8

    regex : (a|b)*a(a|b)

    cpu time
     8  130.188505

    regex : (a|b)*a(a|b)(a|b)
    cpu time
     8  113.549269

    regex : (a|b)*a(a|b)(a|b)(a|b)
     8  114.059856

    regex : (a|b)*a(a|b)(a|b)(a|b)(a|b)
     8  115.274656

    egrep -o '(a|b)*a(a|b)(a|b)' file/ab1GB.txt
    223.31s user 40.86s system 99% cpu 4:24.17 total

    egrep -o '(a|b)*a(a|b)(a|b)(a|b)' file/ab1GB.txt
    240.23s user 35.46s system 99% cpu 4:35.70 total

    egrep -o '(a|b)*a(a|b)(a|b)(a|b)(a|b)' file/ab1GB.txt
    252.63s user 35.24s system 99% cpu 4:47.87 total

    Print があまりにも大きすぎて time の大半は Print じゃないか説


    CentOS 7.2 os.cr.ie.u-ryukyu.ac.jp