moonfire-nvr

mirror of https://github.com/scottlamb/moonfire-nvr.git synced 2024-12-27 15:45:55 -05:00

Author	SHA1	Message	Date
Scott Lamb	7fe9d34655	cargo fix --all * it added "dyn" to trait objects * it changed "..." in patterns to "..=" cargo --version says: "cargo 1.37.0-nightly (545f35425 2019-05-23)"	2019-06-14 08:47:11 -07:00
Scott Lamb	c271cfa2b5	make Writer enforce maximum recording duration My installation recently somehow ended up with a recording with a duration of 503793844 90,000ths of a second, way over the maximum of 5 minutes. (Looks like the machine was pretty unresponsive at the time and/or having network problems.) When this happens, the system really spirals. Every flush afterward (12 per minute with my installation) fails with a CHECK constraint failure on the recording table. It never gives up on that recording. /var/log fills pretty quickly as this failure is extremely verbose (a stack trace, and a line for each byte of video_index). Eventually the sample file dirs fill up too as it continues writing video samples while GC is stuck. The video samples are useless anyway; given that they're not referenced in the database, they'll be deleted on next startup. This ensures the offending recording is never added to the database, so we don't get the same persistent problem. Instead, writing to the recording will fail. The stream will drop and be retried. If the underlying condition that caused a too-long recording (many non-key-frames, or the camera returning a crazy duration, or the monotonic clock jumping forward extremely, or something) has gone away, the system should recover.	2019-01-29 08:26:36 -08:00
Scott Lamb	3ba3bf2b18	backend support for live stream (#59 ) This is so far completely untested, for use by a new UI prototype. It creates a new URL endpoint which sends one video/mp4 media segment per key frame, with the dependent frames included. This means there will be about one key frame interval of latency (typically about a second). This seems hard to avoid, as mentioned in issue #59.	2019-01-21 15:58:52 -08:00
Scott Lamb	b9e6a6461f	fix streamer tests broken by `4cc796f` That commit rewrote the db/writer.rs tests. But the flush op they used before was also used by src/streamer.rs tests. Reintroduce it.	2019-01-06 07:07:04 -08:00
Scott Lamb	4cc796f697	properly test fix for #64 I went with the third idea in `1ce52e3`: have the tests run each iteration of the syncer explicitly. These are messy tests that know tons of internal details, but I think they're less confusing and racy than if I had the syncer running in a separate thread.	2019-01-04 16:11:58 -08:00
Scott Lamb	1ce52e334c	fix #64 (extraneous flushes) Now each syncer has a binary heap of the times it plans to do a flush. When one of those times arrives, it rechecks if there's something to do. Seems more straightforward than rechecking each stream's first uncommitted recording, especially with the logic to retry failed flushes every minute. Also improved the info! log for each flush to see the actual recordings being flushed for better debuggability. No new tests right now. :-( They're tricky to write. One problem is that it's hard to get the timing right: a different flush has to happen after Syncer::save's database operations and before Syncer::run calls SimulatedClocks::recv_timeout with an empty channel[], advancing the time. I've thought of a few ways of doing this: adding a new SyncerCommand to run something, but it's messy (have to add it from the mock of one of the actions done by the save), and Box<dyn FnOnce() + 'static> not working (see rust-lang/rust#28796) makes it especially annoying. * replacing SimulatedClocks with something more like MockClocks. Lots of boilerplate. Maybe I need to find a good general-purpose Rust mock library. (mockers sounds good but I want something that works on stable Rust.) * bypassing the Syncer::run loop, instead manually running iterations from the test. Maybe the last way is the best for now. I'm likely to try it soon. [*] actually, it's calling Receiver::recv_timeout directly; Clocks::recv_timeout is dead code now? oops.	2019-01-04 13:47:44 -08:00
Scott Lamb	55fa458288	fix confusing variable name + comment	2019-01-04 06:16:25 -08:00
Scott Lamb	b5387af3d4	lose "extern crate" everywhere (Rust 2018 edition)	2018-12-28 21:59:39 -06:00
Scott Lamb	699ec87968	upgrade to 2018 Rust edition This is mostly just "cargo fix --edition" + Cargo.toml changes. There's one fix for upgrading to NLL in db/writer.rs: Writer::previously_opened wouldn't build with NLL because of a double-borrow the previous borrow checker somehow didn't catch. Restructure to avoid it. I'll put elective NLL changes in a following commit.	2018-12-28 14:59:06 -06:00
Scott Lamb	131c5e0640	Fix "no garbage row for <id>" flush failure loops Add some comments along the way. Fixes #63.	2018-12-01 00:03:43 -08:00
Scott Lamb	d7a94956eb	deflake writer tests There was a race condition here because it wasn't waiting for the db flush to complete. This made write_path_retries sometimes not reflect the consequence of the flush, causing an assertion failure. I assume it was also responsible for gc_path_retries timeouts under travis-ci.	2018-08-07 21:58:40 -05:00
Scott Lamb	91636d3193	refine flush_if_sec behavior The new behavior eliminates a couple unpleasant edge cases in which it would never flush: * if all recording stops, whatever was unflushed would stay that way * if every recording attempt produces a 0-duration recording (such as if the camera sends only one frame and thus no PTS delta can be calculated), the list of recordings to flush would continue to grow	2018-03-23 15:16:43 -07:00
Scott Lamb	addeb9d2f6	add a TimerGuard around db locks & ops I moved the clocks member from LockedDatabase to Database to make this happen, so the new DatabaseGuard (replacing a direct MutexGuard<LockedDatabase>) can access it before acquiring the lock. I also made the type of clock a type parameter of Database (and so several other things throughout the system). This allowed me to drop the Arc<>, but more importantly it means that the Clocks trait doesn't need to stay object-safe. I plan to take advantage of that shortly.	2018-03-23 13:31:23 -07:00
Scott Lamb	4c8daa6d24	save timestamps along with opens	2018-03-10 16:15:36 -08:00
Scott Lamb	03809eee8e	clean up leftover throwaway logging	2018-03-09 10:46:34 -08:00
Scott Lamb	d6fa470713	tests and fixes for Writer and Syncer * separate these out into a new file, writer.rs, as dir.rs was getting unwieldy. * extract traits for the parts of SampleFileDir and std::fs::File they needed; set up mock implementations. * move clock.rs to a new base crate to be accessible from the db crate. * add tests that exercise all the retry paths. * bugfix: account for the new recording's bytes when calculating how much to delete. * bugfix: when retrying an unlink failure in collect_garbage, we shouldn't warn about all the recordings no longer existing. Do this by retrying each step rather than the whole procedure again. * avoid double-panic scenarios, which I hit while tweaking the mocks. These are quite annoying to debug as Rust doesn't print information about either panic. I ended up using lldb to get a backtrace. Better to be cautious about what we're doing when already panicking. * give more context on raw::insert_recording errors, which I hit as well while tweaking the new tests.	2018-03-07 04:42:46 -08:00

16 Commits