FFmpeg / Libav Summer Of Code 2011: Difference between revisions

From MultimediaWiki
Jump to navigation Jump to search
(Move up DTS encoder to 1st Tier Project)
Line 39: Line 39:
''Mentor: [[User:Suxen_drol|Peter Ross]]
''Mentor: [[User:Suxen_drol|Peter Ross]]


=== DTS Encoder (Qualification claimed by Xiang Wang) ===
=== DTS Encoder ===
* Specification: http://wiki.multimedia.cx/index.php?title=Mirror
* Specification: http://wiki.multimedia.cx/index.php?title=Mirror
* Sample files: http://samples.mplayerhq.hu/A-codecs/DTS/
* Sample files: http://samples.mplayerhq.hu/A-codecs/DTS/
Line 47: Line 47:
* Third goal: Advanced psycho acoustics.
* Third goal: Advanced psycho acoustics.


(Qualification claimed by Xiang Wang)
''Mentor: Benjamin Larsson''
''Mentor: Benjamin Larsson''



Revision as of 07:35, 14 March 2011

Qualification tasks

For us to consider your application for SoC we require a completed qualification task. Many Summer-of-Code projects (in the list below) have specific qualification tasks. These tasks are meant to make you familiar with the code that you will be working with, are at approximately the same difficulty level as the actual Summer-of-Code project itself (just a lot smaller), and often already provide you with a jumpstart into your Summer-of-Code project. We suggest the following order of events:

  • First, select a Summer-of-Code project (either from the list below, but in some cases you may also come up with your own)
  • Second, discuss this project with the person that will mentor it. If a mentor is listed, talk to him on IRC, via email or so. If no mentor is listed, find one by emailing the FFmpeg-soc mailinglist.
  • With your mentor, discuss the most appropriate qualification task for your Summer-of-Code project.

If no specific qualification task is listed for your project of interest, you can discuss with your mentor to choose a task from the Small Tasks list or the Interesting Patches list instead. If your prospective mentor agrees, please send an email to the FFmpeg-soc mailing list to inform that you are working on it (to avoid duplicated work). You can discuss progress on your qualification task and initial review of your finished qualification task on the FFmpeg-soc mailinglist (sometimes the patch will have to go through final review on the main FFmpeg development mailinglist before it can be applied also). The qualification task is considered completed when your patch is accepted to our main SVN tree.

Before posting to the FFmpeg-soc mailinglist, make sure you read and understand our netiquette guidelines, especially avoid top-posting and thread-hijacking (note that if you don't understand one of those terms, make sure to have understood them before writing your first post). Before you send us your patch, read our development guidelines and make sure your patch fulfills all the requirements stated there. You should also familiar with the programs diff, patch and Subversion. You have to learn these basics on your own before you start, we will not teach them to you during the application process.

A completed FFmpeg qualifications task is also accepted as a qualification task for the VLC organization (does NOT include x264).

1st Tier Project Proposals

1st tier project proposals are project ideas that are reasonably well defined AND have a mentor volunteered.

BSAC AAC Decoder

  • Separate AAC Huffman decoding and dequantization from the rest of AAC decoder.
  • Write a BSAC AAC decoder.
  • Reuse as much of the existing AAC decoder and libavcodec as possible.

Mentor: Alex Converse, task claimed by Young Han Lee (qualification task: AAC LTP profile)

VC-1 decoder missing features

  • Add support for interlaced streams as used in Bluray recordings to the VC-1 decoder.
  • Add support for chroma/luma scaling, used in RTP/VC1.
  • Add support for slice decoding, used in RTP/VC1.
  • Add optimizations for x86-32/64 (or ARM), so that HD streams play back faster.
  • This includes fixing some reference streams

Mentor: Kostya Shishkov, Ronald S. Bultje and (maybe) Christophe Gisquet

Windows Television (WTV) muxer

  • Qualification task: develop a prototype WTV muxer that can produce files then able to be demuxed by FFmpeg.
  • The objective of this project is produce files that are compatible with Microsoft Windows Media Center. To achieve this, one must determine which of the many undocumented data types are expected by Microsoft Stream Buffer Engine, discover their format, and implement these in the muxer.
  • The muxer must support metadata and indexing.
  • Clearly defined task; reverse engineering skills required.

Mentor: Peter Ross

DTS Encoder

  • Specification: http://wiki.multimedia.cx/index.php?title=Mirror
  • Sample files: http://samples.mplayerhq.hu/A-codecs/DTS/
  • Qualification: Extend the encoder in the SoC tree (port the float transform from the decoder, fix the LFE channel generation).
  • Primary goal: Encoder that can produce multi sample rate, multi channel files and multi bit-rate. Wav and raw muxing support.
  • Secondary goal: Techniques from specification implemented, optimal codebook usage, vector quantization, simple psycho acoustics.
  • Third goal: Advanced psycho acoustics.

(Qualification claimed by Xiang Wang) Mentor: Benjamin Larsson

1st tier proposals of last year

These are the same as above-mentioned 1st tier proposals, but we're not sure yet if we'll repeat them. Once mentors are OK, please move them up.

NEW Seeking API

  • primary goal: implement a new seeking API in libavformat
    • implement av_seek_file in libavformat
    • implement compatible new seek_file for all AVInputFormat porting existing seek function if possible.
    • implement av_build_index function which will build an AVIndex for the file
    • implement av_export_index function which will save AVIndex in a file which can be loaded later.

Mentor: Baptiste Coudurier

AACS implementation

  • this task probably is to get libbluray integrated with vlc/mplayer/ffmpeg and working with as many discs as possible.
  • Add the ability to encode and decode using Advanced Access Content System to FFmpeg.
  • Specifications: http://www.aacsla.com/specifications/
  • existing implementation e.g. DumpHD: http://forum.doom9.org/showthread.php?t=123111
  • Most parts (BD-J, MKB, title key generation) probably do not belong into FFmpeg, this should be discussed with us before submitting an application
    • possible solution: only implement "lowest" level (decode given the correct title key) but implement CSS en- and decryption as secondary goal

Mentor: Reimar Döffinger

Libavfilter video work

Libavfilter is the FFmpeg filtering library that started as a 2007 SoC project. It replaced the now removed vhook subsystem. Most of it is already part of the FFmpeg main source tree, but there a few bits remaining. This project would consist in a combination of the following tasks:

  • Get the remaining bits of the SoC tree committed (this includes: the rotate, split, and fps filters)
  • Port all the missing filters from MPlayer (do not forget asking the authors if it is ok to release them under the LGPL)
  • Framework: implement dynamic-reconfiguration of the filterchain, for supporting dynamic size/format changes
  • Port filters from other frameworks (yuvjpeg, effectv, frei0r, virtualdub, vlc, etc...)
  • Write wrappers for more image processing libraries and filtering frameworks (libgimp, libgraphicsmagic, weed), extend the libopencv wrapper for supporting more filters (this may need implemented float image support in libswscale)
  • Write more filters (possibly starting from already posted filters which for a reason or another were never committed, e.g. concatenate, fish, 2xsai, select, lut, eval, posterize, elbg/posterize etc.)
  • Implement a compose filter (suggestion for a better name?), for creating a mosaic of the input video streams
  • Write a movie sink (e.g. it could be useful for implementing a display functionality, e.g. when the video is dumped to an output device)
  • Write a frequency domain filter using the FFT implementation in libavcodec
  • Write a Matlab/Octave/SAGE scripting wrapper (assuming it can be done)
  • More ideas?

Mentor: To be determined

Libavfilter audio work

At the moment, FFmpeg filtering library has incomplete support for handling audio. This task would consist of a combination of the following tasks:

  • Fix the remaining outstanding issues and complete the audio framework integration (check the saste's audio-filters FFmpeg branch on gitorious)
  • Make the resampling filter works for several combinations of sample format and channels
  • Write a visualization filter as proof-of-concept of a filter that works with both video and audio
  • Extend/improve libav* audio resampling/rematrixing/requantization capabilities (this may need to re-design the audio API, maybe it deserves a task on its own)
  • Write/port basic audio filters (check libaf in MPlayer/libmpcodecs)
  • Write wrapper for audio frameworks (sox, ladspa, ...)

Mentor: To be determined

Rewrite the audio format conversion code

  • right now, we're using audioresample to resample audio (samplerate / channels) and audioconvert to resample audio format (int, float, 16-bit, 32-bit). We'd like a new, swscale-style implementation that combines these two in a single codebase
  • design should allow for SIMD optimization of popular conversions (float-int16, int16-float)
  • fix bugs in current design (clipping, overflows when going from float to int, questionable rounding)

Mentor: to be decided


h264 decoder optimizations

  • Quite simply, make our decoder faster
  • Qualification task: make our decoder 1% faster

Mentor: Michael Niedermayer

Add support for Bayer RGB colour format

Since it is not even clear if this should be implemented in libswscale or libavcodec, this should be discussed on ffmpeg-devel or ffmpeg-soc before submitting. There was a related discussion on mplayer-devel once and at least two related FFmpeg issues:

Mentor: Michael Niedermayer

2nd Tier Project Proposals

All that separates these proposals from their 1st tier brethren is a lack of mentor.

Finish SoC projects from previous years

Some projects are lingering in the dark unfinished. They should be picked up and made ready for inclusion. These projects are potentially less involved than starting from scratch, but also more useful for FFmpeg since the probability that the projects get finished should be higher. If some of them are deemed too easy, they could be combined.

Unfinished projects from previous years are:

2007:

2008:

2009:

For the current status of all SoC projects up to date, see FFmpeg Summer Of Code.

Implement a Windows Video Presentation 2 decoder

RALF Realaudio Lossless

  • RE and implement a decoder for this format
  • Reuse as much libavcodec code as possible

libvo

  • Port MPlayer's libvo to ffplay
  • Note that this does not just mean to produce a working hack so that ffplay can use xv, but a clean and acceptable wrapper for (most of) libvo.
  • An alternative might be to only port the OpenGL part of libvo into a new glplay. This should be discussed on ffmpeg-devel before submission.

Speex Decoder

Also see Speex.

CELT Decoder

Also see CELT.

AMR-NB Encoder

Also see AMR.

VP6 Encoder

WMV3 Encoder

  • Clearly defined task
  • Primary goal: Encode video sequences such that they can be decoded by a Windows Media player.

This could either be done by improving this patch or by writing the encoder from scratch.

Improve subtitle support

  • Add text-to-bitmap conversion functions
  • One with hard-coded bitmaps for characters
  • One that utilizes freetype
  • Function used will be chosen upon compilation

Adjust existing subtitle support to new ABI

Improve Ratecontrol

  • Primary goal 1: Fast heuristic VBV compliant per macroblock ratecontrol which has a better PSNR/bitrate and better subjective quality/bitrate than the current code.
  • Primary goal 2: VBV compliant, rate distortion optimal per macroblock ratecontrol using the viterbi algorithm.
  • Secondary goal 1: Fast heuristic scene change detection which detects scene changes more accurately, has better PSNR/bitrate and subjective quality/bitrate than the current heuristic.
  • Secondary goal 2: Rate distortion optimal (for the current picture) scene change detection.
  • Secondary goal 3: B frames decision which is faster and or has a higher PSNR/bitrate and subjective quality/bitrate than the current code.

SILK decoder and/or encoder

DTS-HD Master Audio decoder

  • You should know exactly what you do if you submit an application.

ATRAC3plus

MPEG-4 SLS

DirectShow capture

  • Implement an indev for DirectShow capture on Windows (and support A/V synchronisation).
  • Qualification: Get GDI and waveform capture integrated into FFmpeg SVN with A/V synchronisation. Both patches are almost finished.