moonfire-nvr

Commit Graph

Author	SHA1	Message	Date
Scott Lamb	d6fa470713	tests and fixes for Writer and Syncer * separate these out into a new file, writer.rs, as dir.rs was getting unwieldy. * extract traits for the parts of SampleFileDir and std::fs::File they needed; set up mock implementations. * move clock.rs to a new base crate to be accessible from the db crate. * add tests that exercise all the retry paths. * bugfix: account for the new recording's bytes when calculating how much to delete. * bugfix: when retrying an unlink failure in collect_garbage, we shouldn't warn about all the recordings no longer existing. Do this by retrying each step rather than the whole procedure again. * avoid double-panic scenarios, which I hit while tweaking the mocks. These are quite annoying to debug as Rust doesn't print information about either panic. I ended up using lldb to get a backtrace. Better to be cautious about what we're doing when already panicking. * give more context on raw::insert_recording errors, which I hit as well while tweaking the new tests.	2018-03-07 04:42:46 -08:00
Scott Lamb	b78ffc3808	view in-progress recordings! The time from recorded to viewable was previously 60-120 sec for the first recording of a RTSP session, 0-60 sec otherwise. Now it's one frame.	2018-03-02 15:40:32 -08:00
Scott Lamb	b2a8b3c216	update "moonfire-nvr check" for new schema	2018-03-01 17:07:42 -08:00
Scott Lamb	b677964d1a	properly account for bytes to add with next flush This was considering them as 0, so it would under-delete until the next flush them delete all at once. That effectively doubled the number of bytes not yet deleted as they're first transferred to garbage, flushed again, then unlinked.	2018-03-01 13:50:59 -08:00
Scott Lamb	0f2e71ec4a	more safety around adding/deleting dirs	2018-03-01 12:24:32 -08:00
Scott Lamb	f01f523c2c	refine 1->3 upgrade process In hindsight, the "post_tx" step in the upgrade process introduced in `e7f5733` doesn't make sense. If the procedure fails at this stage, nothing says it still needs to be completed. If the sample file dirs have to be updated after the database, then there should be another database version to mark that it's fully completed, and indeed that's the purpose version 3 serves. So get rid of the Upgrader trait and just go back to a simple run function per version. In the case of the sample file dir metadata, it actually can happen before the database transaction; the stuff written to the database later just needs to be consistent with what it finds if there's an existing metadata file from a half-completed update. For safety, ensure there are no unexpected directory contents before upgrading 1->2, and ensure the metadata matches before upgrading 2->3.	2018-03-01 09:47:56 -08:00
Scott Lamb	bcf42fe02c	move db upgrade logic into db crate This allows shrinking db's API surface.	2018-02-28 21:21:47 -08:00
Scott Lamb	fb4d88d3e2	make db::dir::Writer equally stubborn Every recording it starts must be sent to the syncer with at least one sample written. It will try forever (unless the channel is down, then panic). This avoids the situation in which it prevents something in the uncommitted VecDeque from ever being synced and thus any further recordings from being flushed.	2018-02-28 12:32:52 -08:00
Scott Lamb	b1d71c4e8d	improve Syncer's robustness The new approach is to, rather than panicking, retry forever. The assumption is that if a given operation is failing, a following operation is unlikely to succeed, so it's simpler to just keep trying the earlier one than come up with ways to undo it and proceed with later operations. I still need to apply this approach to the Writer class. It currently unwraps (crashes) or just gives up on a recording without ever sending it to the Syncer. Given that recordings are all synced in order, that means further ones can never be synced.	2018-02-28 11:07:55 -08:00
Scott Lamb	8d9939603e	fix repeated deletions within a flush When list_oldest_recordings was called twice with no intervening flush, it returned the same rows twice. This led to trying to delete it twice and all following flushes failing with a "no such recording x/y" message. Now, return each row only once, and track how many bytes have been returned. I think dir.rs's logic is still wrong for how many bytes to delete when multiple recordings are flushed at once (it ignores the bytes added by the first when computing the bytes to delete for the second), but this is progress.	2018-02-23 13:49:57 -08:00
Scott Lamb	843e1b49c8	take FnMut closures by reference I mistakenly thought these had to be monomorphized. (The FnOnce still does, until rust-lang/rfcs#1909 is implemented.) Turns out this way works fine. It should result in less compile time / code size, though I didn't check this.	2018-02-23 09:19:42 -08:00
Scott Lamb	bf45ae6011	extend recording_playback with an open_id As noted in schema.sql, this can be used for disambiguation. It also may be useful in diagnosing data integrity problems. Also, sneak in a couple minor improvements: better diagnostics in a couple places, fix to 1->2 upgrade procedure.	2018-02-22 21:46:41 -08:00
Scott Lamb	b037c9bdd7	knob to reduce db commits (SSD write cycles) This improves the practicality of having many streams (including the doubling of streams by having main + sub streams for each camera). With these tuned properly, extra streams don't cause any extra write cycles in normal or error cases. Consider the worst case in which each RTSP session immediately sends a single frame and then fails. Moonfire retries every second, so this would formerly cause one commit per second per stream. (flush_if_sec=0 preserves this behavior.) Now the commits can be arbitrarily infrequent by setting higher values of flush_if_sec. WARNING: this isn't production-ready! I hacked up dir.rs to make tests pass and "moonfire-nvr run" work in the best-case scenario, but it doesn't handle errors gracefully. I've been debating what to do when writing a recording fails. I considered "abandoning" the recording then either reusing or skipping its id. (in the latter case, marking the file as garbage if it can't be unlinked immediately). I think now there's no point in abandoning a recording. If I can't write to that file, there's no reason to believe another will work better. It's better to retry that recording forever, and perhaps put the whole directory into an error state that stops recording until those writes go through. I'm planning to redesign dir.rs to make this happen.	2018-02-22 16:35:34 -08:00
Scott Lamb	31adbc1e9f	initial split of database to a separate crate It should reduce compile time / memory usage to put quite a bit of the code into a separate crate. I also intend to limit visibility of some things to only within the db crate, but that's for a future change. This is the smallest move that will compile.	2018-02-20 23:15:39 -08:00

14 Commits