FFmpeg

mirror of https://git.ffmpeg.org/ffmpeg.git synced 2024-10-19 13:03:26 +00:00

Author	SHA1	Message	Date
Anton Khirnov	5939c8d361	lavu/fifo: disallow overly large fifo sizes The API currently allows creating FIFOs up to - UINT_MAX: av_fifo_alloc(), av_fifo_realloc(), av_fifo_grow() - SIZE_MAX: av_fifo_alloc_array() However the usable limit is determined by - rndx/wndx being uint32_t - av_fifo_[size,space] returning int so no FIFO should be larger than the smallest of - INT_MAX - UINT32_MAX - SIZE_MAX (which should be INT_MAX an all commonly used platforms). Return an error on trying to allocate FIFOs larger than this limit.	2022-02-07 00:29:05 +01:00
Andreas Rheinhardt	2d71f93c7c	avutil/fifo: Use av_fifo_generic_peek_at() for av_fifo_generic_peek() Avoids code duplication. It furthermore properly checks for buf_size to be > 0 before doing anything. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-02-07 00:29:05 +01:00
Lynne	2e82c61055	x86/tx_float: avoid redefining macros FFT16_FN was used for fft8 and for fft16 afterwards.	2022-02-02 07:51:45 +01:00
Zhao Zhili	b5a8b3d45a	hwcontext_vulkan: use VkPhysicalDeviceTimelineSemaphoreFeatures VkPhysicalDeviceVulkan12Features isn't implemented on MoltenVK yet. VkPhysicalDeviceTimelineSemaphoreFeatures is less versatile but simple. None of device_features_1_1 nor device_features_1_2 has real usage yet, keep the code for future.	2022-02-01 22:54:24 +01:00
Andreas Rheinhardt	98cef1ebbe	avutil/tests/adler32: Remove unnecessary volatile And use an ordinary stack variable. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-01-29 21:26:26 +01:00
Timo Rothenpieler	2f323b1978	avutil/hwcontext_qsv: fix typo	2022-01-29 15:37:38 +01:00
nyanmisaka	4cc7239d8b	libavutil/hwcontext_opencl: fix a bug for mapping qsv frame to opencl mfxHDLPair was added to qsv, so modify qsv->opencl map function as well. Now the following commandline works: ffmpeg -v verbose -init_hw_device vaapi=va:/dev/dri/renderD128 \ -init_hw_device qsv=qs@va -init_hw_device opencl=ocl@va -filter_hw_device ocl \ -hwaccel qsv -hwaccel_output_format qsv -hwaccel_device qs -c:v h264_qsv \ -i input.264 -vf "hwmap=derive_device=opencl,format=opencl,avgblur_opencl, \ hwmap=derive_device=qsv:reverse=1:extra_hw_frames=32,format=qsv" \ -c:v h264_qsv output.264 Signed-off-by: nyanmisaka <nst799610810@gmail.com> Signed-off-by: Wenbin Chen <wenbin.chen@intel.com>	2022-01-29 12:02:52 +08:00
Lynne	35080149ef	x86/tx_float: mark AVX2 functions as AVXSLOW Makes Bulldozer prefer AVX functions rather than AVX2, which are 64% slower: AVX: 117653 decicycles in av_tx (fft), 1048535 runs, 41 skips AVX2: 193385 decicycles in av_tx (fft), 1048561 runs, 15 skips The only difference between both is that vgatherdpd is used in the former. We don't want to mark them with the new SLOW_GATHER flag however, since gathers are still faster on Haswell/Zen 2/3 than plain loads.	2022-01-29 03:08:16 +01:00
Lynne	7e35e0224c	lavu/tx: do not unconditionally free subcontexts if initialization fails If a codelet initializes 2 subtransforms, and the second one fails, the failure would free all subcontexts. Instead, if there are subcontexts still left, don't free the array. If all initializations fail, the init() function will return, and reset_ctx() from the previous step will clean up all contained subtransforms.	2022-01-29 01:02:37 +01:00
Lynne	265731f201	lavu/tx: reset subcontext pointer if initialization fails Thanks to mkver for pointing this out.	2022-01-29 00:53:35 +01:00
Lynne	95f02e43e1	lavu/tx: print debug info even if no transforms are found	2022-01-28 08:28:02 +01:00
Steven Liu	9887ec3e9b	avutil/tx: add null pointer check after av_mallocz Fix CID: 1497863 there will get null pointer in attempt to initialize each if alloc memory failed. Signed-off-by: Steven Liu <liuqi05@kuaishou.com>	2022-01-28 08:27:48 +01:00
Steven Liu	f0044d886f	avutil/tx: remove deadcode of the control flow Fix CID: 1497864 The control flow should return ENOSYS if nb_cd_matches is 0 at before and the ret equal AVERROR(ENOMEM) or goto end label, so remove the last control flow if (ret >= 0) before end label. Signed-off-by: Steven Liu <liuqi05@kuaishou.com>	2022-01-28 08:27:46 +01:00
Lynne	3c831847a8	hwcontext_vulkan: avoid using 64-bit enums MSVC (2016, but possibly more) still force enums to be basic ints.	2022-01-27 10:27:09 +01:00
Lynne	238e11b71f	lavu/tx: avoid using 64-bit enums MSVC (2016, but possibly more) still force enums to be basic ints.	2022-01-27 10:21:25 +01:00
Lynne	6c397f6bb5	x86/tx_float: add missing FF_TX_OUT_OF_PLACE flag to functions This caused smaller length dedicated transforms to not be picked up.	2022-01-27 02:18:35 +01:00
Lynne	008c131d68	lavu/tx: clean up CPU flags check Just makes it more readable.	2022-01-27 02:18:06 +01:00
Lynne	9787005846	x86/tx_float: do not build tx_float_init.c if x86 assembly is disabled This broke builds with --disable-mmx, which also disabled assembly entirely, but ARCH_X86 was still true, so the init file tried to find assembly that didn't exist. Instead of checking for architecture, check if external x86 assembly is enabled.	2022-01-27 02:17:46 +01:00
Lynne	6c8e841824	lavu/tx: do not mix declarations and code	2022-01-26 04:55:23 +01:00
Lynne	28bff6ae54	x86/tx_float: add permute-free FFT versions These are used in the PFA transforms and MDCTs.	2022-01-26 04:13:58 +01:00
Lynne	350142560b	lavu: bump minor and add APIchanges for new lavu/tx additions	2022-01-26 04:13:57 +01:00
Lynne	af94ab7c7c	lavu/tx: add an RDFT implementation RDFTs are full of conventions that vary between implementations. What I've gone for here is what's most common between both fftw, avcodec's rdft and what we use, the equivalent of which is DFT_R2C for forward and IDFT_C2R for inverse. The other 2 conventions (IDFT_R2C and DFT_C2R) were not used at all in our code, and their names are also not appropriate. If there's a use for either, we can easily add a flag which would just flip the sign on one exptab. For some unknown reason, possibly to allow reusing FFT's exp tables, av_rdft's C2R output is 0.5x lower than what it should be to ensure a proper back-and-forth conversion. This code outputs its real samples at the correct level, which matches FFTW's level, and allows the user to change the level and insert arbitrary multiplies for free by setting the scale option.	2022-01-26 04:12:46 +01:00
Lynne	ef4bd81615	lavu/tx: rewrite internal code as a tree-based codelet constructor This commit rewrites the internal transform code into a constructor that stitches transforms (codelets). This allows for transforms to reuse arbitrary parts of other transforms, and allows transforms to be stacked onto one another (such as a full iMDCT using a half-iMDCT which in turn uses an FFT). It also permits for each step to be individually replaced by assembly or a custom implementation (such as an ASIC).	2022-01-26 04:12:44 +01:00
Lynne	c14976be04	lavu/tx: improve documentation for existing transforms	2022-01-26 04:12:37 +01:00
Diederick Niehorster	7247a6fed8	avutil/pixfmt.h: typo Signed-off-by: Diederick Niehorster <dcnieho@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-01-21 11:58:23 +01:00
Anton Khirnov	137c808f1a	lavu/hwcontext_vulkan: clear dangling pointers on map failure	2022-01-21 09:44:01 +01:00
Limin Wang	8b9ef5a516	avutil/parseutils: use quadhd for Quad HD qHD is 960x540 (q stands for quarter) and QHD is 2560x1440 (Q is quad). use quadhd for QHD for abbreviation. Fix ticket#9591 Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2022-01-12 13:42:26 +08:00
Anton Khirnov	f480c43dfa	lavu/fifo: return errors on trying to read/write too much Trying to write too much will currently overwrite previous data. Trying to read too much will either av_assert2() in av_fifo_drain() or return old data. Trying to peek too much will either av_assert2() in av_fifo_generic_peek_at() or return old data. Return an error code in all these cases, which is safer and more consistent.	2022-01-10 16:11:34 +01:00
Anton Khirnov	53f513c60b	lavu/fifo: drop useless comments This object was never intended to be thread-safe, so these carry no useful information.	2022-01-10 16:11:18 +01:00
Anton Khirnov	549ccea54e	lavu/fifo: do not copy the whole fifo when reallocating av_realloc() the buffer and only move the part of the ring buffer that needs it. Also avoids allocating a temporary fifo.	2022-01-10 16:05:57 +01:00
Anton Khirnov	5010c481d1	lavu/fifo: simplify av_fifo_alloc() Turn it into a wrapper around av_fifo_alloc_array().	2022-01-10 16:05:20 +01:00
Anton Khirnov	63b013aa68	lavu/fifo: deprecate av_fifo_peek2() It returns a pointer inside the fifo's buffer, which cannot be safely used without accessing AVFifoBuffer internals. It is easier and safer to use av_fifo_generic_peek_at().	2022-01-10 16:04:19 +01:00
Cameron Gutman	242ed971cb	lavu/videotoolbox: add support for memory mapping frames Signed-off-by: Cameron Gutman <aicommander@gmail.com> Signed-off-by: Aman Karmani <aman@tmm1.net>	2022-01-06 19:17:42 -08:00
Wu Jianhua	c4ecc643bb	avutil/hwcontext_vulkan: fixed incorrect memory offset This commit fixed hwupload in Vulkan: ffmpeg -init_hw_device vulkan -i test.jpg -vf hwupload,hwdownload,format=yuv420p -y out.jpg Signed-off-by: Wu Jianhua <jianhua.wu@intel.com>	2022-01-05 14:13:39 +01:00
Haihao Xiang	7c6f9b9d63	Revert "avutils/hwcontext: When deriving a hwdevice, search for existing device in both directions" This reverts commit `a428949775`. There were objections on ML (see https://ffmpeg.org/pipermail/ffmpeg-devel/2021-December/290530.html) Signed-off-by: Haihao Xiang <haihao.xiang@intel.com>	2022-01-05 11:56:58 +08:00
Soft Works	a428949775	avutils/hwcontext: When deriving a hwdevice, search for existing device in both directions The test /libavutil/tests/hwdevice checks that when deriving a device from a source device and then deriving back to the type of the source device, the result is matching the original source device, i.e. the derivation mechanism doesn't create a new device in this case. Previously, this test was usually passed, but only due to two different kind of flaws: 1. The test covers only a single level of derivation (and back) It derives device Y from device X and then Y back to the type of X and checks whether the result matches X. What it doesn't check for, are longer chains of derivation like: CUDA1 > OpenCL2 > CUDA3 and then back to OpenCL4 In that case, the second derivation returns the first device (CUDA3 == CUDA1), but when deriving OpenCL4, hwcontext.c was creating a new OpenCL4 context instead of returning OpenCL2, because there was no link from CUDA1 to OpenCL2 (only backwards from OpenCL2 to CUDA1) If the test would check for two levels of derivation, it would have failed. This patch fixes those (yet untested) cases by introducing forward references (derived_device) in addition to the existing back references (source_device). 2. hwcontext_qsv didn't properly set the source_device In case of QSV, hwcontext_qsv creates a source context internally (vaapi, dxva2 or d3d11va) without calling av_hwdevice_ctx_create_derived and without setting source_device. This way, the hwcontext test ran successful, but what practically happened, was that - for example - deriving vaapi from qsv didn't return the original underlying vaapi device and a new one was created instead: Exactly what the test is intended to detect and prevent. It just couldn't do so, because the original device was hidden (= not set as the source_device of the QSV device). This patch properly makes these setting and fixes all derivation scenarios. (at a later stage, /libavutil/tests/hwdevice should be extended to check longer derivation chains as well) Reviewed-by: Lynne <dev@lynne.ee> Reviewed-by: Anton Khirnov <anton@khirnov.net> Tested-by: Wenbin Chen <wenbin.chen@intel.com> Signed-off-by: softworkz <softworkz@hotmail.com> Signed-off-by: Haihao Xiang <haihao.xiang@intel.com>	2022-01-05 11:05:06 +08:00
Andreas Rheinhardt	b189550137	lib*/version.h: Bump Versions after release/5.0 branch This is done a second time for 5.0 because master was merged into 5.0 so that it contains the recent DOVI additions. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-01-04 14:29:06 +01:00
Andreas Rheinhardt	c512be9a90	lib*/version.h: Bump Versions before release/5.0 branch Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-01-04 13:40:03 +01:00
Niklas Haas	78dc21b123	lavu/frame: Add Dolby Vision metadata side data type In order to be able to extend this struct later (as the Dolby Vision RPU evolves), all of the 'container' structs are considered extensible, and the individual constituent fields must instead be accessed via offsets. The precedent for this style of access is set in <libavutil/detection_bbox.h> Signed-off-by: Niklas Haas <git@haasn.dev> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-01-04 11:59:02 +01:00
Michael Niedermayer	4be85c9331	lib*/version.h: Bump Versions after release/5.0 branch Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2022-01-03 22:10:46 +01:00
Michael Niedermayer	f3964a59e1	lib*/version.h: Bump Versions before release/5.0 branch Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2022-01-03 22:08:31 +01:00
Lynne	b84ce56589	hwcontext_vulkan: remove VK_EXT_hdr_metadata from autoloaded extensions list We don't use it. Was copied from libplacebo's recommended defaults. Creates problems with validation on Intel devices, where the driver still advertizes it, even though it's not usable without a swapchain.	2022-01-03 03:16:45 +01:00
Wenbin Chen	ed6c5c13b1	libavutil/hwcontext_qsv: clean padding when upload qsv frames Fix #7830 When we upload a frame that is not padded as MSDK requires, we create a new AVFrame to copy data. The frame's padding data is uninitialized so it brings run to run problem. For example, If we run the following command serveral times we will get different outputs. ffmpeg -init_hw_device qsv=qsv:hw -qsv_device /dev/dri/renderD128 \ -filter_hw_device qsv -f rawvideo -s 192x200 -pix_fmt p010 \ -i 192x200_P010.yuv -vf "format=nv12,hwupload=extra_hw_frames=16" \ -c:v hevc_qsv output.265 According to https://github.com/Intel-Media-SDK/MediaSDK/blob/master/doc/mediasdk-man.md#encoding-procedures "Note: It is the application's responsibility to fill pixels outside of crop window when it is smaller than frame to be encoded. Especially in cases when crops are not aligned to minimum coding block size (16 for AVC, 8 for HEVC and VP9)" I add a function to fill padding area with border pixel to fix this run2run problem, and also move the new AVFrame to global structure to reduce redundant allocation operation to increase preformance. Signed-off-by: Wenbin Chen <wenbin.chen@intel.com> Signed-off-by: Haihao Xiang <haihao.xiang@intel.com>	2021-12-23 15:49:07 +08:00
rcombs	5afc5661ac	lavu/hwcontext_videotoolbox: use OS-provided mapping routines when available	2021-12-22 18:43:34 -06:00
rcombs	b7e1ec7bda	lavu/videotoolbox: expose routine to set CVPixelBufferRef metadata	2021-12-22 18:43:17 -06:00
rcombs	69bd95dcd8	lavu/videotoolbox: expose conversion routines for color parameters Also fixes symbol lookup errors on older macOS when built with a newer SDK, introduced in `6cab5206b0`	2021-12-22 18:42:51 -06:00
James Almer	e1d3ef9217	avutil/tests/cpu: add slowgather Signed-off-by: James Almer <jamrial@gmail.com>	2021-12-21 17:52:09 -03:00
James Almer	e68e379e0c	avutil/cpu: add slowgather to av_parse_cpu_caps() Signed-off-by: James Almer <jamrial@gmail.com>	2021-12-21 17:51:27 -03:00
James Almer	8c2d2fd6cc	avutil/cpu: move slow gather checks below in the function Put them together with other similar slow flag checks. Signed-off-by: James Almer <jamrial@gmail.com>	2021-12-21 17:51:17 -03:00
Alan Kelly	ffbab99f2c	libavutil/cpu: Add AV_CPU_FLAG_SLOW_GATHER. This flag is set on Haswell and earlier and all AMD cpus.	2021-12-21 17:44:44 -03:00

1 2 3 4 5 ...

5434 Commits