apps/lpms - lpms - 子说镜像小站

apps/lpms

mirror of https://github.com/livepeer/lpms synced 2026-04-22 15:57:25 +08:00

Author	SHA1	Message	Date
Josh Allmann	c44af7253b	Fix up SEI after picture data	2026-03-10 01:18:36 +00:00
Josh Allmann	bdff08d755	Clamp timestamps going into muxer and encoder	2026-03-10 01:18:36 +00:00
Josh Allmann	d9612732bc	Additional NOPTS test case	2026-03-10 01:18:36 +00:00
Josh Allmann	e019d826bd	Add unit tests for inputs with AV_NOPTS_VALUE	2026-03-10 01:18:36 +00:00
Josh Allmann	6eb7e5ef87	Handle AV_NOPTS_VALUE inputs	2026-03-10 01:18:36 +00:00
Josh Allmann	4d9ab6275a	Fix occassional DTS overlap Fix an occassional DTS overlap by closing the filtergraph after each segment and re-creating it at the beginning of each segment, instead of attempting to persist the filtergraph in between segments. This overlap occurred mostly when flip-flopping segments between transcoders, or processing non-consecutive segments within a single transcoder. This was due to drift in adjusting input timestamps to match the fps filter's expectation of mostly consecutive timestamps while adjusting output timestamps to remove accumulated delay from the filter. There is roughly a 1% performance hit on my machine from re-creating the filtergraph. Because we are now resetting the filter after each segment, we can remove a good chunk of the special-cased timestamp handling code before and after the filtergraph since we no longer need to handle discontinuities between segments. However, we do need to keep some filter flushing logic in order to accommodate low-fps or low-frame content. This does change our outputs, usually by one fewer frame. Sometimes we seem to produce an additional frame - it is unclear why. However, as the test cases note, this actually clears up a numer of long-standing oddities around the expected frame count, so it should be seen as an improvement. --- It is important to note that while this fixes DTS overlap in a (rather unpredictable) general case, there is another overlap bug in one very specific case. These are the conditions for bug: 1. First and second segments of the stream are being processed. This could be the same transcoder or different ones. 2. The first segment starts at or near zero pts 3. mpegts is the output format 4. B-frames are being used What happens is we may see DTS < PTS for the very first frames in the very first segment, potentially starting with PTS = 0, DTS < 0. This is expected for B-frames. However, if mpegts is in use, it cannot take negative timestamps. To accompdate negative DTS, the muxer will set PTS = -DTS, DTS = 0 and delay (offset) the rest of the packets in the segment accordingly. Unfortunately, subsequent transcodes will not know about this delay! This typically leads to an overlap between the first and second segments (but segments after that would be fine). The normal way to fix this would be to add a constant delay to all segments - ffmpeg adds 1.4s to mpegts by default. However, introducing a delay right now feels a little odd since we don't really offer any other knobs to control the timestamp (re-transcodes would accumulate the delay) and there is some concern about falling out of sync with the source segment since we have historically tried to make timestamps follow the source as closely as possible. So we're leaving this particular bug as-is for now. There is some commented-out code that adds this delay in case we feel that we would need it in the future. Note that FFmpeg CLI also has the exact same problem when the muxer delay is removed, so this is not a LPMS-specific issue. This is exercised in the test cases. Example of non-monotonic DTS after encoding and after muxing: Segment.Frame \| Encoder DTS \| Encoder PTS \| Muxer DTS \| Muxer PTS --------------\|-------------\|-------------\|-----------\|----------- 1.1 \| -20 \| 0 \| 0 \| 20 1.2 \| -10 \| 10 \| 10 \| 30 1.3 \| 0 \| 20 \| 20 \| 40 1.4 \| 10 \| 30 \| 30 \| 50 2.1 \| 20 \| 40 \| 20 \| 40 2.2 \| 30 \| 50 \| 30 \| 50 2.3 \| 40 \| 60 \| 40 \| 60	2026-03-10 01:13:30 +00:00
Tessa	6cbac64198	fix: initialize demuxer_opts to NULL in transcode_init (#446 ) The demuxer_opts pointer was left uninitialized when inp->demuxer.opts was NULL. This caused avformat_open_input to receive a garbage pointer, leading to a crash in av_dict_copy when processing dictionary options. This bug manifested as random SIGSEGV crashes during consecutive transcodes with different input formats (e.g., TestAPI_ConsecutiveMP4s). Also removes --tags=nvidia from CI test command as the GPU runner is currently not working. Signed-off-by: livepeer-tessa <livepeer-tessa@users.noreply.github.com> Co-authored-by: livepeer-tessa <livepeer-tessa@users.noreply.github.com>	2026-03-09 14:16:06 -07:00
Josh Allmann	e8530933f8	ffmpeg: abort runaway encodes when FPS/filter explodes frame count Some inputs can trigger the FPS/filter pipeline to generate far more output frames than are actually decoded, leading to very long, disk-filling transcodes. Plumb decoded frame counts into the encoder path and, for video outputs, abort with `lpms_ERR_ENC_RUNAWAY` when encoded frames exceed 25x decoded frames, excluding `image2` inputs where expansion is expected. The exact inputs which trigger this behavior are unknown as of now but we can construct a contrived test which reproduces the issue.	2026-01-27 22:16:48 +00:00
Josh Allmann	a53e20a334	Fix pts overflow in fps filter (#438 ) This was causing some very large segments to be produced if the input had some weird characteristics like missing timestamps. Co-authored-by: Marco van Dijk <marco@stronk.rocks>	2025-09-05 08:20:32 -07:00
Brad \| ad-astra	c1eefa63ff	add duration check for input files (#434 ) * add duration check for input file to ensure not transcoding long inputs and protect against inputs that have time stamp anomalies causing the output to be much longer than the input --------- Co-authored-by: Josh Allmann <joshua.allmann@gmail.com>	2025-05-29 11:10:47 -05:00
Josh Allmann	79e6dcf080	Use ffmpeg parser for H.264 (#431 ) Fixes a number of things including a LPMS crash, choppy video quality, green screens during rotation, inconsistent frame counts vs software decoding, etc. We also apparently gained GPU support for MPEG2 decoding. This is a massive change: we can no longer add outputs up front due to the ffmpeg hwaccel API, so we have to wait until we receive a decoded video frame in order to add outputs. This also means properly queuing up audio and draining things in the same order.	2025-01-17 17:43:04 -08:00
Josh Allmann	fc96cadb63	Fix crash on PNG options	2024-12-03 01:24:05 +00:00
Josh Allmann	c4721b87bd	ffmpeg: Add demuxer options. (#426 ) This adds demuxer options as a complement to the existing encoder/muxer options which allows us to: 1. explicitly select the demuxer to use if probing doesn't return a good result 2. configure the demuxer with additional options This has come up a few times while looking at various things so it is good to have an API that is fully configurable out of the box.	2024-12-02 15:38:36 -08:00
Tibia2000	7b07ba3a22	Update ffmpeg.go change preset from slow to medium (#427 ) Going from 2 pass encode slow preset to medium preset.	2024-11-22 15:58:37 +01:00
Josh Allmann	fe5aff1fa6	ffmpeg: Add a metadata option to each output (#421 ) This will allow us to identify the software version and who transcoded a given segment.	2024-09-09 10:10:57 -07:00
Josh Allmann	f87352959b	ffmpeg: Clamp resolutions in filter expression (#417 ) This allows the transcoded resolution to be re-clamped correctly if the input resolution changes mid-segment. As a result, we no longer need to do this clamping in golang. Additionally, make the behavior between GPU and CPU more consistent by applying nvidia codec limits and clamping CPU transcodes.	2024-08-19 11:04:16 -07:00
Josh Allmann	9d6ea5f718	Plug memory leak when printing filter graph. (#419 )	2024-08-19 10:42:34 -07:00
Josh Allmann	0e6fd2e7e2	ffmpeg: Add tests for rotation.	2024-08-19 17:18:56 +00:00
Josh Allmann	def71fab18	ffmpeg: Handle EAGAIN from decoder and drain This usually happens with CUVID if the decoder needs to be reset internally for whatever reason, such as a mid-stream resolution change. Also block demuxing until decoder is ready to receive packets again.	2024-08-19 17:18:10 +00:00
Josh Allmann	f03385968e	ffmpeg: Reset the flush packet after each keyframe. This handles cases where the packet may contain a frame that triggers a decoder reset - we do not want to cause a reset during the flushing process.	2024-08-19 17:18:10 +00:00
Josh Allmann	808675b414	ffmpeg: Re-init encoder on resolution change.	2024-08-19 17:18:10 +00:00
Josh Allmann	be728e92af	ffmpeg: Flush filters before re-initialization. Also add another condition for re-initialization: if the input resolution changes. This triggers the filter graph to re-build and adjust to the new resolution, when CPU encoders are in use.	2024-08-19 17:18:10 +00:00
Josh Allmann	b5181eb92c	ffmpeg: Rescale DTS better during FPS passthrough (#416 ) This mostly ensures that non-B frames have the same dts/pts. The PTS/DTS from the encoder can be "squashed" a bit during rescaling back to the source timebase if it is used directly, due to the lower resolution of the encoder timebase. We avoid this problem with the PTS in in FPS passthrough mode by reusing the source pts, but only rescale the encoder-provided DTS back to the source timebase for some semblance of timestamp consistency. Because the DTS values are squashed, they can differ from the PTS even with non-B frames. The DTS values are still monotonic, so the exact numbers are not really important. However, some tools use `dts == pts` as a heuristic to check for B-frames ... so help them out to avoid spurious B-frame detections. To fix the DTS/PTS mismatch, take the difference between the encoder-provided dts/pts, rescale that difference back to the source time base, and re-calculate the dts using the source pts. Also see https://github.com/livepeer/lpms/pull/405	2024-08-12 10:36:42 +01:00
Josh Allmann	c3330413a4	ffmpeg: Estimate duration for some audio formats in GetCodecInfoBytes	2024-08-09 12:43:42 -07:00
Elite Encoder	e67ff9f4ee	Add media duration to lpms_get_codec_info for GetCodecInfo (#407 ) * add fps and duration to GetCodecInfo	2024-08-09 12:43:42 -07:00
Yondon Fu	46dd338141	ffmpeg: Support image2 demuxer	2024-08-09 12:43:42 -07:00
Yondon Fu	d5161b8104	ffmpeg: Use helper to check for video metadata	2024-08-09 12:43:42 -07:00
kevincatty	4b092b8fc6	chore: fix some typos in comment (#397 ) Signed-off-by: kevincatty <zhanshanmao@outlook.com>	2024-07-25 09:27:29 -07:00
Josh Allmann	956ccf4d08	Fix typo in params field (#409 )	2024-07-25 09:08:51 -07:00
Josh Allmann	144a983869	Remove unused transcoder2 (#410 )	2024-07-25 09:08:29 -07:00
Josh Allmann	d9c78b62ef	Update FFmpeg to 7.0.1 (#406 ) * Port install_ffmpeg.sh from go-livepeer * Update ffmpeg and nv-codec-headers versions. * Use local install_ffmpeg.sh in github CI * Update transcoder for ffmpeg 7.0.1 * Update tests to be compatible with ffmpeg7 binary * Fix FPS passthrough * Set the encoder timebase using AVCodecContext.framerate instead of the decoder's AVCodecContext.time_base. The use of AVCodecContext.time_base is deprecated for decoding. See https://ffmpeg.org/doxygen/3.3/structAVCodecContext.html#ab7bfeb9fa5840aac090e2b0bd0ef7589 * Adjust the packet timebase as necessary for FPS pass through to match the encoder's expected timebase. For filtergraphs using FPS adjustment, the filtergraph output timebase will match the framerate (1 / framerate) and the encoder is configured for the same. However, for FPS pass through, the filtergraph's output timebase will match the input timebase (since there is no FPS adjustment) while the encoder uses the timebase detected from the decoder's framerate. Since the input timebase does not typically match the FPS (eg 90khz for mpegts vs 30fps), we need to adjust the packet timestamps (in container timebase) to the encoder's expected timebase. * For the specific case of FPS passthrough, preserve the original PTS as much as possible since we are trying to re-encode existing frames one-to-one. Use the opaque field for this, since it is already being populated with the original PTS to detect sentinel packets during flushing. Without this, timestamps can be slightly "squashed" down when rescaling output packets to the muxer's timebase, due to the loss of precision (eg, demuxer 90khz -> encoder 30hz -> muxer 90khz)	2024-07-10 20:45:24 -07:00
Josh Allmann	3db1a12ba0	Update nvidia tests to pass with VFR changes. The nvidia test suite was never run after #393 so this breakage was not noticed.	2024-07-10 12:39:33 -07:00
Josh Allmann	11a5584d69	Improve VFR support (#393 ) * Improve VFR support. Manually calculate the duration of each frame and set the PTS to that before submitting to the filtergraph. This allows us to better support variable frame rates, and is also better aligned with how ffmpeg does it. This may change the number of frames output by the FPS filter by +/- 1 frame. These aren't issues in themselves but breaks a lot of test cases which will need to be updated. * Update test cases for VFR.	2024-05-13 18:15:33 +02:00
Thom Shutt	98566e26c0	Remove scene detection code (#377 ) * Remove scene detection code * Remove tensorflow from runner	2024-01-15 10:31:13 +00:00
Rafał Leszko	663c62246a	Use scale_cuda in Windows (#375 )	2023-10-02 15:11:46 +02:00
Rafał Leszko	4fe16b9b90	Change scale_cuda to scale_npp (#373 )	2023-09-18 09:38:15 +02:00
Rafał Leszko	f59041708b	Rename profile.CRF to profile.Quality (#374 )	2023-09-14 11:50:59 +02:00
Rafał Leszko	e8052f0da2	Improve Transcode Quality (#371 ) Add params: - `-preset slow` - `-tier high` - `-cq` / `-crf`	2023-08-30 14:09:57 +02:00
Eli Mallon	20aa13b816	ffmpeg: add 'import C' where it's used	2023-05-01 17:49:17 -07:00
AlexKordic	32cd1538cc	add extra info to out-resolution clamp() error (#369 )	2023-01-25 11:59:58 +00:00
Max Holland	78ae448364	output log line when refactored transcoder implementation is used (#362 )	2023-01-06 12:58:35 +00:00
Ivan Poleshchuk	e4a46d5f5e	Ensure signature comparison success ratio among Nvidia and CPU signatures (#357 ) * Signature comparison classification metrics test * Signature files * generate signatures in the test, address review comments	2022-12-03 11:56:11 +04:00
AlexKordic	7cef5fc8c1	Skip resolution adjustment in case of missing video track (#358 )	2022-11-23 20:25:53 +01:00
Oscar	dacf4ac6ff	lpms: calculate similarity per audio packet (#352 )	2022-07-28 16:45:27 +08:00
Oscar	35888364c2	change SENTINEL_MAX and video comparison (#350 )	2022-07-18 20:59:08 +08:00
MikeIndiaAlpha	ce67815fad	Merge branch 'master' into ma_refactoring_first	2022-07-14 13:01:30 +02:00
Michal Adamczak	c0858be434	Switching between transcode() versions This commit allows for switching between transcode() function implementations. Refactored transcode2() will be used when environment variable LPMS_USE_NEW_TRANSCODE is set, and the original transcode() otherwise. The idea is to allow for gradual roll-out of new refactored implementation.	2022-07-14 12:52:47 +02:00
Oscar	ae24b7a15d	lpms: add exception proc for weird DTS and PTS (#346 )	2022-07-11 18:10:26 +08:00
Michal Adamczak	55d1b5d5f8	Demux reopen not in lpms_transcode() Due to my omission the previous solution was going against the grain of upcoming refactoring changes in lpms_transcode(). Basically the idea is to have streamlined transcoder init() code so that changes such as Low Latency can be implemented easily in one place, instead of being distributed in many flows. Besides, adding yet another "mode" to lpms_transcode() is not really needed. The transcoder is kinda like any other object instance - it retains its status between the lpms_transcode() calls. So I think it is easier to add extra operations (such as lpms_transcode_reopen_demux() introduced here) to change said state, instead of increase the complexity of lpms_transcode() call. And finally, perhaps most important thing: Changes like these are needed because LPMS doesn't really chave good handling of changing configuration (in this case we are talking about changing from container stream without audio to the one with audio). It kind of pretends of doing so, and will handle certain small differences kinda ok, but it is very easy to come up with situation that will break it completely. Ideally, I'd like to change that by reducing transient state to the minimum (such as certain number of hardware buffers for decoding and encoding) and really re-initialize everything else that can be reinitialized cheaply. This however requires low level hardware codec programming, it is not possible to do that from ffmpeg level.	2022-07-04 11:09:35 +02:00
Oscar	6ef0b4b0ed	reopen context when it detected audio track changing (#341 ) reopen context when it detected audio track changing	2022-06-30 09:01:35 +08:00

1 2 3 4 5

246 Commits