moonfire-nvr

mirror of https://github.com/scottlamb/moonfire-nvr.git synced 2025-01-13 07:53:22 -05:00

Author	SHA1	Message	Date
Scott Lamb	c0da1ef880	make v1->v3 upgrade work with --features=bundled --features=bundled enables -DSQLITE_DEFAULT_FOREIGN_KEYS=1, and so some operations have to be done in the proper order. * enable foreign key enforcement all the time, so I test this more reliably. * reorder some parts of the v1->v3 order. foreign key enforcement is immediate (rather than deferred) by default. and ensure old_recording_playback isn't left with a dangling reference to old_recording at the v2 stage. Instead, wait until v3 to delete tables it depends on.	2018-03-22 09:05:40 -07:00
Scott Lamb	c46c50af8f	fix another upgrade error in `dfee66c`	2018-03-22 00:08:49 -07:00
Scott Lamb	2ff7ecb6f4	fix upgrade procedure broken in `dfee66c`	2018-03-22 00:00:39 -07:00
Scott Lamb	1c9f2a4d83	initial schema for user authentication (#26 ) This is only the database schema, which I'm adding now in the hopes of freezing schema version 3. There's no way yet to create users, much less actually authenticate.	2018-03-21 23:57:45 -07:00
Scott Lamb	dfee66c84b	support additional recording_integrity timestamps These are not actually populated by the code yet. I'm trying to get the v3 schema frozen as soon as possible; actually using the fields can come later. Add some explanation of their value in time.md, along with some general musing on leap seconds, and a correction on the frequency error of my cameras.	2018-03-21 22:32:41 -07:00
Scott Lamb	88051a1188	adjust startup timings again I forgot to drop the cache before grabbing the numbers earlier today.	2018-03-20 22:37:45 -07:00
Scott Lamb	bdf52d743b	adjust some timings in schema.md The new numbers are taken from my odroid setup. In particular, the size check is noticeably slower than what I'd gathered before, enough to show that it shouldn't be performed on startup.	2018-03-20 08:46:48 -07:00
Scott Lamb	4c8daa6d24	save timestamps along with opens	2018-03-10 16:15:36 -08:00
Scott Lamb	f81d699c8c	new recording_integrity table A couple rarely-used fields move to here, and I expect I'll add more. Redo the check command to just put everything in RAM for simplicity.	2018-03-09 13:37:30 -08:00
Scott Lamb	03809eee8e	clean up leftover throwaway logging	2018-03-09 10:46:34 -08:00
Scott Lamb	d6fa470713	tests and fixes for Writer and Syncer * separate these out into a new file, writer.rs, as dir.rs was getting unwieldy. * extract traits for the parts of SampleFileDir and std::fs::File they needed; set up mock implementations. * move clock.rs to a new base crate to be accessible from the db crate. * add tests that exercise all the retry paths. * bugfix: account for the new recording's bytes when calculating how much to delete. * bugfix: when retrying an unlink failure in collect_garbage, we shouldn't warn about all the recordings no longer existing. Do this by retrying each step rather than the whole procedure again. * avoid double-panic scenarios, which I hit while tweaking the mocks. These are quite annoying to debug as Rust doesn't print information about either panic. I ended up using lldb to get a backtrace. Better to be cautious about what we're doing when already panicking. * give more context on raw::insert_recording errors, which I hit as well while tweaking the new tests.	2018-03-07 04:42:46 -08:00
Scott Lamb	b78ffc3808	view in-progress recordings! The time from recorded to viewable was previously 60-120 sec for the first recording of a RTSP session, 0-60 sec otherwise. Now it's one frame.	2018-03-02 15:40:32 -08:00
Scott Lamb	45f7b30619	allow listing and viewing uncommitted recordings There may be considerable lag between being fully written and being committed when using the flush_if_sec feature. Additionally, this is a step toward listing and viewing recordings before they're fully written. That's a considerable delay: 60 to 120 seconds for the first recording of a run, 0 to 60 seconds for subsequent recordings. These recordings aren't yet included in the information returned by /api/?days=true. They probably should be, but small steps.	2018-03-02 11:38:11 -08:00
Scott Lamb	b17761e871	move list_recordings_by_* logic into raw.rs I want to start having the db.rs version augment this with the uncommitted recordings, and it's nice to have the separation of the raw db vs augmented versions. Also, this fits with the general theme of shrinking db.rs a bit. I had to put the raw video_sample_entry_id into the rows rather than the video_sample_entry Arc. In hindsight, this is better anyway: the common callers don't need to do the btree lookup and arc clone on every row. I think I'd originally done it that way only because I was quite new to rust and didn't understand that db could be used from within the row callback given that both borrows are immutable.	2018-03-01 20:59:05 -08:00
Scott Lamb	b2a8b3c216	update "moonfire-nvr check" for new schema	2018-03-01 17:07:42 -08:00
Scott Lamb	b677964d1a	properly account for bytes to add with next flush This was considering them as 0, so it would under-delete until the next flush them delete all at once. That effectively doubled the number of bytes not yet deleted as they're first transferred to garbage, flushed again, then unlinked.	2018-03-01 13:50:59 -08:00
Scott Lamb	0f2e71ec4a	more safety around adding/deleting dirs	2018-03-01 12:24:32 -08:00
Scott Lamb	f01f523c2c	refine 1->3 upgrade process In hindsight, the "post_tx" step in the upgrade process introduced in `e7f5733` doesn't make sense. If the procedure fails at this stage, nothing says it still needs to be completed. If the sample file dirs have to be updated after the database, then there should be another database version to mark that it's fully completed, and indeed that's the purpose version 3 serves. So get rid of the Upgrader trait and just go back to a simple run function per version. In the case of the sample file dir metadata, it actually can happen before the database transaction; the stuff written to the database later just needs to be consistent with what it finds if there's an existing metadata file from a half-completed update. For safety, ensure there are no unexpected directory contents before upgrading 1->2, and ensure the metadata matches before upgrading 2->3.	2018-03-01 09:47:56 -08:00
Scott Lamb	bcf42fe02c	move db upgrade logic into db crate This allows shrinking db's API surface.	2018-02-28 21:21:47 -08:00
Scott Lamb	fbe1231af0	move open_id from recording_playback to recording I want to be able to use it in etags without having to do a full scan of the recording_playback in advance, which would greatly increase time to first byte. I probably will even use it in urls to ensure the segments they point to are stable. I haven't actually done this yet - it will wait until I implement serving unflushed recordings - but I want to get the schema set up properly.	2018-02-28 20:52:43 -08:00
Scott Lamb	fb4d88d3e2	make db::dir::Writer equally stubborn Every recording it starts must be sent to the syncer with at least one sample written. It will try forever (unless the channel is down, then panic). This avoids the situation in which it prevents something in the uncommitted VecDeque from ever being synced and thus any further recordings from being flushed.	2018-02-28 12:32:52 -08:00
Scott Lamb	b1d71c4e8d	improve Syncer's robustness The new approach is to, rather than panicking, retry forever. The assumption is that if a given operation is failing, a following operation is unlikely to succeed, so it's simpler to just keep trying the earlier one than come up with ways to undo it and proceed with later operations. I still need to apply this approach to the Writer class. It currently unwraps (crashes) or just gives up on a recording without ever sending it to the Syncer. Given that recordings are all synced in order, that means further ones can never be synced.	2018-02-28 11:07:55 -08:00
Scott Lamb	b790075ca2	fix flush_if_sec updates not hitting db	2018-02-23 14:49:10 -08:00
Scott Lamb	81e1ec97b3	more sanity checking for delete_oldest_recordings	2018-02-23 14:05:07 -08:00
Scott Lamb	8d9939603e	fix repeated deletions within a flush When list_oldest_recordings was called twice with no intervening flush, it returned the same rows twice. This led to trying to delete it twice and all following flushes failing with a "no such recording x/y" message. Now, return each row only once, and track how many bytes have been returned. I think dir.rs's logic is still wrong for how many bytes to delete when multiple recordings are flushed at once (it ignores the bytes added by the first when computing the bytes to delete for the second), but this is progress.	2018-02-23 13:49:57 -08:00
Scott Lamb	843e1b49c8	take FnMut closures by reference I mistakenly thought these had to be monomorphized. (The FnOnce still does, until rust-lang/rfcs#1909 is implemented.) Turns out this way works fine. It should result in less compile time / code size, though I didn't check this.	2018-02-23 09:19:42 -08:00
Scott Lamb	d9841fd634	fix compilation error in benchmark This needs a separate run of "cargo +nightly bench --features=nightly", so I missed it in a couple previous commits. I probably should set up travis-ci...	2018-02-22 21:57:43 -08:00
Scott Lamb	bf45ae6011	extend recording_playback with an open_id As noted in schema.sql, this can be used for disambiguation. It also may be useful in diagnosing data integrity problems. Also, sneak in a couple minor improvements: better diagnostics in a couple places, fix to 1->2 upgrade procedure.	2018-02-22 21:46:41 -08:00
Scott Lamb	b037c9bdd7	knob to reduce db commits (SSD write cycles) This improves the practicality of having many streams (including the doubling of streams by having main + sub streams for each camera). With these tuned properly, extra streams don't cause any extra write cycles in normal or error cases. Consider the worst case in which each RTSP session immediately sends a single frame and then fails. Moonfire retries every second, so this would formerly cause one commit per second per stream. (flush_if_sec=0 preserves this behavior.) Now the commits can be arbitrarily infrequent by setting higher values of flush_if_sec. WARNING: this isn't production-ready! I hacked up dir.rs to make tests pass and "moonfire-nvr run" work in the best-case scenario, but it doesn't handle errors gracefully. I've been debating what to do when writing a recording fails. I considered "abandoning" the recording then either reusing or skipping its id. (in the latter case, marking the file as garbage if it can't be unlinked immediately). I think now there's no point in abandoning a recording. If I can't write to that file, there's no reason to believe another will work better. It's better to retry that recording forever, and perhaps put the whole directory into an error state that stops recording until those writes go through. I'm planning to redesign dir.rs to make this happen.	2018-02-22 16:35:34 -08:00
Scott Lamb	31adbc1e9f	initial split of database to a separate crate It should reduce compile time / memory usage to put quite a bit of the code into a separate crate. I also intend to limit visibility of some things to only within the db crate, but that's for a future change. This is the smallest move that will compile.	2018-02-20 23:15:39 -08:00
Scott Lamb	d84e754b2a	replace homegrown Error with failure crate This reduces boilerplate, making it a bit easier for me to split the db stuff out into its own crate.	2018-02-20 22:46:14 -08:00
Scott Lamb	253f3de399	reorganize the sample file directory The filenames now represent composite ids (stream id + recording id) rather than a separate uuid system with its own reservation for a few benefits: * This provides more information when there are inconsistencies. * This avoids the need for managing the reservations during recording. I expect this to simplify delaying flushing of newly written sample files. Now the directory has to be scanned at startup for files that never got written to the database, but that's acceptably fast even with millions of files. * Less information to keep in memory and in the recording_playback table. I'd considered using one directory per stream, which might help if the filesystem has trouble coping with huge directories. But that would mean each dir has to be fsync()ed separately (more latency and/or more multithreading). So I'll stick with this until I see concrete evidence of a problem that would solve. Test coverage of the error conditions is poor. I plan to do some restructuring of the db/dir code, hopefully making steps toward testability along the way.	2018-02-20 10:11:10 -08:00
Scott Lamb	e7f5733f29	new database/sample file dir interlock scheme The idea is to avoid the problems described in src/schema.proto; those possibilities have bothered me for a while. A bonus is that (in a future commit) it can replace the sample file uuid scheme in favor of using <camera_uuid>-<stream_type>/<recording_id> for several advantages: * on data integrity problems (specifically, extra sample files), more information to use to understand what happened. * no more reserving sample files prior to using them. This avoids some extra database transactions on startup (now there's an extra two total rather than an extra one per stream). It also simplifies an upcoming change I want to make in which some streams are not flushed immediately, reducing the write load significantly (maybe one per minute total rather than one per stream per minute). * get rid of eight bytes per playback cache entry in RAM (and nine bytes per recording_playback row on flash). The implementation is still pretty rough in places: * Lack of tests. * Poor ode organization. In particular, SampleFileDirectory::write_meta shouldn't be exposed beyond db. I'm thinking about moving db.rs and SampleFileDirectory to a new crate, moonfire_nvr_db. This would improve compile times as well. * No tooling for renaming a sample file directory. * Config subcommand still panics in conditions that can be reasonably expected to happen.	2018-02-14 23:35:52 -08:00
Scott Lamb	89b6bccaa3	support multiple sample file directories This is still pretty basic support. There's no config UI support for renaming/moving the sample file directories after they are created, and no error checking that the files are still in the expected place. I can imagine sysadmins getting into trouble trying to change things. I hope to address at least some of that in a follow-up change to introduce a versioning/locking scheme that ensures databases and sample file dirs match in some way. A bonus change that kinda got pulled along for the ride: a dialog pops up in the config UI while a stream is being tested. The experience was pretty bad before; there was no indication the button worked at all until it was done, sometimes many seconds later.	2018-02-11 23:04:02 -08:00
Scott Lamb	6f309e432f	store rfc6381_codec in the database This avoids having codec-specific logic to synthesize it in db.rs. It's not too much of a problem now with only H.264 support, but it'd be a pain when supporting H.265 and other codecs.	2018-02-05 11:57:59 -08:00
Scott Lamb	cc6579b211	upgrader for v1->v2	2018-02-03 22:19:02 -08:00
Scott Lamb	57c44b5e35	schema 2: add a "record" bool to streams	2018-02-03 22:19:02 -08:00
Scott Lamb	dc402bdc01	schema version 2: support sub streams This allows each camera to have a main and a sub stream. Previously there was a field in the schema for the sub stream's url, but it didn't do anything. Now you can configure individual retention for main and sub streams. They show up grouped in the UI. No support for upgrading from schema version 1 yet.	2018-02-03 22:15:54 -08:00
Scott Lamb	0d69f4f49b	use add_camera in tests, not direct db inserts This is a wash in terms of lines of code now, but it makes it a bit easier to maintain as I make changes to the schema (such as separating out streams from cameras), and it helps ensure the tests reflect reality.	2018-02-03 21:56:04 -08:00
Scott Lamb	c43fb80639	warn if a streamer op takes too long My odroid setup has been occasionally (about once a week) losing about 15 seconds of recordings on all cameras. I'm not sure why. So I'm labelling all the likely suspect spots and logging if any of them takes longer than a second. I think this will give me more information; hopefully narrow it down to network or local disk I/O.	2018-01-31 14:20:30 -08:00
Scott Lamb	6902be1981	upgrade deps	2018-01-30 22:05:39 -08:00
Scott Lamb	d192d98129	more fixes to the upgrade guide	2018-01-30 21:14:53 -08:00
Scott Lamb	9a8ed69047	fix more formatting in upgrade guide	2018-01-30 17:01:44 -08:00
Scott Lamb	7566b6a38b	fix several formatting problems in upgrade guide	2018-01-30 17:01:03 -08:00
Scott Lamb	529aad9982	upgrade deps, including http-serve bugfix This was including the wrong Content-Encoding: header on gzipped json responses, so Chrome wouldn't understand them at all.	2018-01-30 16:51:32 -08:00
Scott Lamb	2c62d977b0	gzip json responses, handle HEAD properly	2018-01-23 11:24:40 -08:00
Scott Lamb	8caa2e5d0e	crate rename: http-(entity\|file) -> http-serve	2018-01-23 11:08:21 -08:00
Scott Lamb	10550bc35f	simplify ffmpeg wrapper crate This was using PhantomData to enforce lifetimes + raw pointers. Simpler to convert to a reference. This also enforces non-null.	2017-11-30 14:40:31 -08:00
Scott Lamb	5c8970fe8a	update dependencies	2017-11-16 23:01:09 -08:00
Scott Lamb	16ed7f73ba	remove redundant panic	2017-11-16 22:54:44 -08:00

1 2 3 4 5

246 Commits