mirror of https://git.ffmpeg.org/ffmpeg.git synced 2024-09-21 13:56:55 +00:00

Go to file

Guo, Yejun 37d24a6c8f vf_dnn_processing: add support for more formats gray8 and grayf32 The following is a python script to halve the value of the gray image. It demos how to setup and execute dnn model with python+tensorflow. It also generates .pb file which will be used by ffmpeg. import tensorflow as tf import numpy as np from skimage import color from skimage import io in_img = io.imread('input.jpg') in_img = color.rgb2gray(in_img) io.imsave('ori_gray.jpg', np.squeeze(in_img)) in_data = np.expand_dims(in_img, axis=0) in_data = np.expand_dims(in_data, axis=3) filter_data = np.array([0.5]).reshape(1,1,1,1).astype(np.float32) filter = tf.Variable(filter_data) x = tf.placeholder(tf.float32, shape=[1, None, None, 1], name='dnn_in') y = tf.nn.conv2d(x, filter, strides=[1, 1, 1, 1], padding='VALID', name='dnn_out') sess=tf.Session() sess.run(tf.global_variables_initializer()) graph_def = tf.graph_util.convert_variables_to_constants(sess, sess.graph_def, ['dnn_out']) tf.train.write_graph(graph_def, '.', 'halve_gray_float.pb', as_text=False) print("halve_gray_float.pb generated, please use \ path_to_ffmpeg/tools/python/convert.py to generate halve_gray_float.model\n") output = sess.run(y, feed_dict={x: in_data}) output = output * 255.0 output = output.astype(np.uint8) io.imsave("out.jpg", np.squeeze(output)) To do the same thing with ffmpeg: - generate halve_gray_float.pb with the above script - generate halve_gray_float.model with tools/python/convert.py - try with following commands ./ffmpeg -i input.jpg -vf format=grayf32,dnn_processing=model=halve_gray_float.model:input=dnn_in:output=dnn_out:dnn_backend=native out.native.png ./ffmpeg -i input.jpg -vf format=grayf32,dnn_processing=model=halve_gray_float.pb:input=dnn_in:output=dnn_out:dnn_backend=tensorflow out.tf.png Signed-off-by: Guo, Yejun <yejun.guo@intel.com> Signed-off-by: Pedro Arthur <bygrandao@gmail.com>		2020-01-07 10:51:38 -03:00
compat	compat/avisynth: Fix unicode compilation.	2020-01-01 16:38:28 +01:00
doc	vf_dnn_processing: add support for more formats gray8 and grayf32	2020-01-07 10:51:38 -03:00
ffbuild	build: add support for building CUDA files with clang	2019-08-04 19:08:08 +02:00
fftools	ffmpeg: don't force source-tracked keyframes for duplicates	2020-01-05 12:01:46 +05:30
libavcodec	avcodec/libvpxenc,cosmetics: prefer sizeof(var)	2020-01-06 22:35:38 -08:00
libavdevice	Silence "string-plus-int" warning shown by clang.	2020-01-06 22:38:56 +01:00
libavfilter	vf_dnn_processing: add support for more formats gray8 and grayf32	2020-01-07 10:51:38 -03:00
libavformat	avformat/aviobuf: Honor avio_open[2] documentation	2020-01-07 02:44:05 +01:00
libavresample	avresample: remove deprecated attribute from the AVAudioResampleContext struct	2018-01-09 10:56:53 -03:00
libavutil	Silence "string-plus-int" warning shown by clang.	2020-01-06 22:38:56 +01:00
libpostproc	Silence "string-plus-int" warning shown by clang.	2020-01-06 22:38:56 +01:00
libswresample	Silence "string-plus-int" warning shown by clang.	2020-01-06 22:38:56 +01:00
libswscale	Silence "string-plus-int" warning shown by clang.	2020-01-06 22:38:56 +01:00
presets
tests	avformat/matroskadec: Fix lzo decompression	2019-12-28 22:40:13 -03:00
tools	tools/target_dec_fuzzer: Stop negative block_align and sampling rate	2019-12-31 12:37:47 +01:00
.gitattributes
.gitignore	tools/python: add script to convert TensorFlow model (.pb) to native model (.model)	2019-07-01 10:23:47 -03:00
.travis.yml	Merge commit '899ee03088d55152a48830df0899887f055da1de'	2019-03-14 15:53:16 -03:00
Changelog	avfilter: add thistogram video filter	2019-12-29 15:33:55 +01:00
configure	configure: Change the configure check for tonemap_vaapi	2020-01-07 00:04:50 +00:00
CONTRIBUTING.md
COPYING.GPLv2
COPYING.GPLv3
COPYING.LGPLv2.1
COPYING.LGPLv3
CREDITS
INSTALL.md	INSTALL.md: Fix Markdown formatting	2019-01-31 10:29:16 -09:00
LICENSE.md	avfilter/vf_geq: Relicense to LGPL	2019-12-28 11:20:48 +01:00
MAINTAINERS	MAINTAINERS: add myself as libxavs2 maintainer	2019-12-07 19:34:57 +01:00
Makefile	tools: add a fuzzer tool for bitstream filters	2019-12-05 20:49:15 -03:00
README.md
RELEASE	RELEASE: Update value for post 4.2 branch	2019-07-21 18:37:32 +02:00

README.md

FFmpeg README

FFmpeg is a collection of libraries and tools to process multimedia content such as audio, video, subtitles and related metadata.

Libraries

libavcodec provides implementation of a wider range of codecs.
libavformat implements streaming protocols, container formats and basic I/O access.
libavutil includes hashers, decompressors and miscellaneous utility functions.
libavfilter provides a mean to alter decoded Audio and Video through chain of filters.
libavdevice provides an abstraction to access capture and playback devices.
libswresample implements audio mixing and resampling routines.
libswscale implements color conversion and scaling routines.

Tools

ffmpeg is a command line toolbox to manipulate, convert and stream multimedia content.
ffplay is a minimalistic multimedia player.
ffprobe is a simple analysis tool to inspect multimedia content.
Additional small tools such as aviocat, ismindex and qt-faststart.

Documentation

The offline documentation is available in the doc/ directory.

The online documentation is available in the main website and in the wiki.

Examples

Coding examples are available in the doc/examples directory.

License

FFmpeg codebase is mainly LGPL-licensed with optional components licensed under GPL. Please refer to the LICENSE file for detailed information.

Contributing

Patches should be submitted to the ffmpeg-devel mailing list using git format-patch or git send-email. Github pull requests should be avoided because they are not part of our review process and will be ignored.