Split cold and warm index writes by rallen090 · Pull Request #3618 · m3db/m3

rallen090 · 2021-07-22T16:24:28Z

No description provided.

…rotations

…locks for evicting series from in-mem segments

rallen090 · 2021-07-22T18:22:50Z

+	// the active block. We ensure that can't happen by always writing the cold index in-mem data
+	// directly.
+	warmBatch := index.NewWriteBatch(batch.Options())
+	coldBatch := index.NewWriteBatch(batch.Options())


This is the simplest implementation for splitting. Assuming this approach works I can fix this up to avoid needless allocs (e.g. if no cold writes, just use the original batch instead of newing two more, etc.)

Since there's only ever a single goroutine at a time ever calling writeBatchForBlockStart(...) can we actually just use a lock (like a new "writeBatchesLock") and then always reuse the same warm batches and cold batches struct and reset at the start of each write batch call? (not sure if it's reuseable but should technically be trivial to make reuseable).

rallen090 · 2021-07-22T18:25:43Z

 // NB: users are expected to use `NewEntry` to construct these objects.
 type Entry struct {
 	relookupAndIncrementReaderWriterCount func() (index.OnIndexSeries, bool)
+	queryableBlockRetriever               series.QueryableBlockRetriever


Note @robskillington the way I'm exposing the way to get "block states" which we check against for RequiresColdFlushForBlockStart, is by having the Entry have a pointer to the dbShard which implements this series.QueryableBlockRetriever type. Let me know of any thoughts here.

robskillington · 2021-07-22T18:28:47Z

+	// Don't pass stats back from insertion into a cold block,
+	// we only care about warm mutable segments stats.


Hm, can't we make that decision in the caller not to use the stats? Or does the "Combine()" method do it automatically? Could we make it a "CombineWithoutStats()" so that at least this doesn't respond with just an empty datastructure that we don't really know should be empty or not from the viewpoint of the caller?

codecov · 2021-07-22T18:35:31Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 46.0%. Comparing base (1032c31) to head (25894ae).
⚠️ Report is 487 commits behind head on master.

❗ There is a different number of reports uploaded between BASE (1032c31) and HEAD (25894ae). Click for more details.

HEAD has 88 uploads less than BASE

Flag BASE (1032c31) HEAD (25894ae)

collector 16 4

aggregator 16 4

m3em 16 4

cluster 16 4

msg 16 4

metrics 16 4

dbnode 16 0

Additional details and impacted files

@@            Coverage Diff            @@
##           master   #3618      +/-   ##
=========================================
- Coverage    57.3%   46.0%   -11.4%     
=========================================
  Files         551     213     -338     
  Lines       64021   19078   -44943     
=========================================
- Hits        36712    8784   -27928     
+ Misses      24115    9608   -14507     
+ Partials     3194     686    -2508

Flag	Coverage Δ
aggregator	`57.2% <ø> (ø)`
cluster	`∅ <ø> (∅)`
collector	`58.4% <ø> (ø)`
dbnode	`?`
m3em	`46.4% <ø> (ø)`
metrics	`19.8% <ø> (ø)`
msg	`74.4% <ø> (+<0.1%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Continue to review full report in Codecov by Harness.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1032c31...25894ae. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

robskillington · 2021-07-22T18:42:01Z

+	if !ok {
+		return false
+	}
+	coldBlocks := entry.Series.ColdFlushBlockStarts(v)


This method unfortunately doesn't quite do what I think you might think it does, it only returns cold blocks that "need to be flushed":
https://github.com/m3db/m3/blob/2e48cd9759e228a92114eaf2b20d28e7f1ee19aa/src/dbnode/storage/series/buffer.go#L436:L445

I would go and implement a new method on src/dbnode/storage/series/series.go which calls src/dbnode/storage/series/buffer.go and returns True if a block start has a cold bucket that's not empty.

i.e. something like:

func (b *dbBuffer) ColdWritesAtBlockStartExist(blockStart xtime.UnixNano) bool { bucketVersions, ok := b.bucketsMap[blockStart] if !ok { return false } for _, bucket := range bucketVersions.buckets { if bucket.writeType == ColdWrite && bucket.streamsLen() > 0 { return true } } return false }

…check has on disk segment (#3622)

…dex-active-block-cold-write-batch

…ck result not aggregated results) (#3631)

…dex-active-block-cold-write-batch

robskillington and others added 30 commits May 18, 2021 06:57

[dbnode] Use active block which GCs series instead of explicit block …

7d21f7d

…rotations

Fix supplying RelookupAndIncrementReaderWriterCount

fde5188

Always write to active block

2fe5644

Set attempt to true when calling OnIndexPrepare

2b167e7

Do not expire from the shard until no longer indexed any block starts

10cd227

Check query range against block state

8270d5c

Remove print

951b420

Allow no metadata

5e2e22d

Fix test

d23c8fc

Only consider blocks both sealed and in-mem data evicted as flushed b…

f1222d5

…locks for evicting series from in-mem segments

Rebased

f29148d

Fix unit test 1

341c8ef

Fix unit test 2

3bdfc8d

Add debug info to integration test

33194dc

Fix integration test

b9a0f32

Fix integration test for rotation

612da98

More unit test fixes

7623cd4

Big unit test fixes

0869a68

More unit test fixes

12045e2

More unit test fixes 2

2d37a0f

More unit test fixes 3

3064b1c

More unit test fixes

e7dc197

Fix test TestNamespaceIndexTick

b288ef0

Fix test TestLimits

7fd4eb9

Fix test TestNamespaceForwardIndexInsertQuery

bbe332a

Fix test TestNamespaceIndexForwardWrite

b9930a1

Fix test TestNamespaceIndexInsertQuery

5d9aee4

Fix test TestNamespaceIndexInsertQuery

75768fe

Fix test TestShardAsyncInsertMarkIndexedForBlockStart

aecdc56

Fix test race condition

9f4ff04

rallen090 added 15 commits July 14, 2021 17:17

Merge remote-tracking branch 'origin/master' into r/index-active-block

8564260

Rebased

f723d6f

Rebased CI

6878e08

Gen

9294049

Gen

ed89e86

Add metric

837131e

Merge remote-tracking branch 'origin/master' into r/index-active-block

1a5362c

Expose RequiresColdFlushForBlockStart on series

9a71208

Split write batch into warm and cold 1

1bef7ce

Split write batch into warm and cold 2

4051ec0

Cleanup

7f81d72

Fix tests

82f4d1c

Fix more tests

7b7b845

Fix more tests 2

932996c

Lint

8e1c02d

rallen090 commented Jul 22, 2021

View reviewed changes

Lint

ac43320

rallen090 commented Jul 22, 2021

View reviewed changes

robskillington reviewed Jul 22, 2021

View reviewed changes

rallen090 and others added 8 commits July 22, 2021 16:53

Feedback 1

32fac77

Fix for index block appearing as a flushed index block, now explicit …

2df8748

…check has on disk segment (#3622)

Merge remote-tracking branch 'origin/r/index-active-block' into ra/in…

d90f3b1

…dex-active-block-cold-write-batch

Test out only removing indexed blockStarts which lack cold writes

219aa14

Account for blockSize and index blockSize being different

da6404f

Fix considering block as flushed based on result from tick (check blo…

508327c

…ck result not aggregated results) (#3631)

Merge remote-tracking branch 'origin/r/index-active-block' into ra/in…

3d43b7b

…dex-active-block-cold-write-batch

Experiment with marking flush completion post index

25894ae

Base automatically changed from r/index-active-block to master August 26, 2021 23:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Split cold and warm index writes#3618

Split cold and warm index writes#3618
rallen090 wants to merge 64 commits into
masterfrom
ra/index-active-block-cold-write-batch

rallen090 commented Jul 22, 2021 •

edited

Loading

Uh oh!

rallen090 Jul 22, 2021

Uh oh!

robskillington Jul 22, 2021 •

edited

Loading

Uh oh!

rallen090 Jul 22, 2021

Uh oh!

robskillington Jul 22, 2021

Uh oh!

codecov Bot commented Jul 22, 2021 •

edited

Loading

Uh oh!

robskillington Jul 22, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		// Don't pass stats back from insertion into a cold block,
		// we only care about warm mutable segments stats.

Uh oh!

Conversation

rallen090 commented Jul 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rallen090 Jul 22, 2021

Choose a reason for hiding this comment

Uh oh!

robskillington Jul 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rallen090 Jul 22, 2021

Choose a reason for hiding this comment

Uh oh!

robskillington Jul 22, 2021

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented Jul 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

robskillington Jul 22, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rallen090 commented Jul 22, 2021 •

edited

Loading

robskillington Jul 22, 2021 •

edited

Loading

codecov Bot commented Jul 22, 2021 •

edited

Loading