Why query same data faster when open db than After compaction #98

cmumford · 2014-09-09T17:31:42Z

Original issue 92 created by david-zf@163.com on 2012-05-26T02:32:37.000Z:

What steps will reproduce the problem?
1.Put 100000000 record in db, db size is 2.8GB
2.After 2-3min when file num is not changed(Compaction should be completed),query data(90 records dispersedly in db) will cost 12s.When close db an reopen, query same data costs 47ms

What is the expected output? What do you see instead?
two query time is same or difference with two query time is small.

What version of the product are you using? On what operating system?
LevelDB 1.4 Windows 7 64bit

Please provide any additional information below.

cmumford · 2014-09-09T17:31:42Z

Comment #1 originally posted by dhruba on 2012-05-26T04:53:28.000Z:

what i the value of numopenfiles in your test case? By default, leveldb keeps 1000 files open at most. also, did u change the size of the blockcache?

There are two possibilities:

when the db is not restarted, the data is cached in the leveldb block cache. Once u restart the server, its cache is cold and it takes longer to query the same data.
leveldb does buffered IO, so data could also be cached in the OS block cache. It is possible that when u restart the db-process, it close the files that were mmaped, thus indicating to the kernel that the corresponding OS pages can be purged. Thus, it might take longer to read the same data when you restart the db process.

cmumford · 2016-01-15T23:46:03Z

Old question - assuming answered.

move reporter internals in both headers and source

cmumford self-assigned this Sep 9, 2014

cmumford added Type-Defect labels Sep 9, 2014

cmumford added question and removed Priority-Medium labels Jan 15, 2016

cmumford closed this as completed Jan 15, 2016

maochongxin pushed a commit to maochongxin/leveldb that referenced this issue Jul 21, 2022

Merge pull request google#98 from google/reporter_change

b260cf7

move reporter internals in both headers and source

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why query same data faster when open db than After compaction #98

Why query same data faster when open db than After compaction #98

cmumford commented Sep 9, 2014

cmumford commented Sep 9, 2014

cmumford commented Jan 15, 2016

Why query same data faster when open db than After compaction #98

Why query same data faster when open db than After compaction #98

Comments

cmumford commented Sep 9, 2014

cmumford commented Sep 9, 2014

cmumford commented Jan 15, 2016