Print — Iraqi Digital Repository

ضغط الصوت باستخدام الكسوريات == Fractal Audio Compression

Author name: وسام فوزي جاسم محمد

Supervisor name: لؤي ادور جورج | ليث عبد العزيز العاني

General topic: Physics

Specific topic: Physics

Degree: Master

University: Al-Nahrain University - College Of Science - Physics Department

Language: English

University location: Baghdad

First pages: 26T1711 - p.pdf

Abstract: لقد استحدثت طرق ضغط عديدة باستخدام تقنيات مختلفة الغرض منها تحقيق نسب ضغط عالية مع المحافظة على جودة الصوت المضغوط, مع الاخذ بنظر الاعتبار انجاز الضغط باقل وقت ممكن.طريقة الضغط الكسوري هي احدى هذه الطرق التي يرافقها ضياع بالبيانات وتعتبر من طرق الضغط الحديثة وتعتمد على عملية ايجاد مجاميع محددة وصغيرة من المعادلات الرياضية التي تصف الاشارة . بارسال معاملات هذه المعادلات الى محلل الشفرة يمكن اعادة تشكيل الاشارة الاصلية.بايجاز ضغط الصوت الكسوري يعتمد على مبدا نظام التكرار المجزئPIFS) ). ضغط الصوت الكسوري يستغل ال(Affine redundancy) والتي تتواجد في الصوت, هذه ال (redundancy) ترتبط بالتشابه الذاتي للصوت.بكلمة اخرى ضغط الصوت الكسوري يقوم بايجادالا نماط المتشابهة والتي تتواجد بقياسات مختلفة(different scales) واماكن مختلفة(different places) في الصوت, وبعد ذللك يقوم بحذف اكثر ما يمكن من ال(redundancy).هذا البحث يهدف الى محاولة التحقق من امكانية تطبيق الضغط الكسوري في عملية ضغط الصوت. النظام المعتمد يتكون من مرحلتين : المرحلة الاولى مرحلة التشفير(Encoding Unit) والثانية مرحلة فك التشفير ) .(Decoding Unitفي مرحلة التشفير يجزا الصوت الاصلي الى نوعين من الكتل, كتل تدعى كتل المجال المقابل ((Range Pool وهي كتل غير متداخلة, وكتل تدعى كتل المجال (Domain Pool) والتي من الممكن ان تكون متداخلة. ومن ثم يتم تقطيع الصوت باستخدام طريقة التقطيع المتساوي للكتل الى كتل متساوية الحجم. بعد ذلك يتم ايجاد افضل كتل في المجال لكل كتلة في المجال المقابل وذلك بتطبيق احد انواع التحويلات وتدعى((Affine Transformation .تنتهي مرحلة التشفير بخزن تفاصيل (معاملات )هذه التحويلات لكل كتلة من كتل المجال المقابل .ان عملية ايجاد الكتل المتشابهة تتطلب عمليات حسابية معقدة تستغرق وقت طويل وهذا ما يؤخذ على طريقة الضغط الكسوري كنقطة ضعف.ان البرامجيات التي تم بناؤها لهذا الغرض تم اختبارها باستخدام خمسة نماذج لبيانات صوتية. لقد تم تنفيذ النظام المعتمد باستخدام فيجوال بيسك6.0 كلغة برمجة, تم استخدام معدل الخطا التربيعي((MSE ونسبة الاشارة الى التشويش((PSNR كمعاملات حساب دقة النتائج المستخلصة من التقنية الكلية في النظام المعتمد.و كانت بعض النتائج المستخلصة من البحث هي بالنسبة للعينة الاولى في حالة حجم الكتلة تساوي (60 ) فان نسبة الضغط (15.51 : 1), اما العينة الثانية في حالة حجم الكتلة تساوي (40) فان نسبة الضغط (11.51 : 1), وفي حالة حجم كتلة صغيرة كما في العينة الثالثة والتي تساوي (20) فان قيمة ال(PSNR) جيدة وتساوي (35.76 dB), وكذالك للعينة الاولى في حالة حجم الكتلة تساوي (10) فان قيمة ال(PSNR) تساوي (45.63 dB). | Fractal audio compression is based on the concept of a partitioned iterated function system (PIFS). Fractal audio compression exploits the affine redundancy that is commonly present in audio; this redundancy is related to the similarity of an audio with itself. In other words, fractal audio compression finds similar patterns that exist in different scales and different places in audio, and then eliminates as much redundancy as possible. In this work the possibility of implementing fractal audio compression is investigated. The implemented system consists of two major units; the first is the Encoding unit and the second is the Decoding unit. Encoding is done by partitioning the range pool (which is the original audio) into non - overlapping blocks, called range blocks, and partitioning the domain pool (which is the result of the original audio after down sampling) into overlapped blocks with the same size of range blocks called domain blocks. A fixed size - partitioning scheme is used to partition the domain pool and the range pool. After generating the range and domain pools, for every range block, the best - matching domain block in the domain pool is searched for by performing a set of affine transformations on them. Thus the encoding is completed by saving the optimal affine parameters for every range block. The Decoding process can be done by repeatedly applying the affine transformation on an initially blank audio and its subsequent reconstructed audio, until it completely reconstruct an approximate wave to the original audio. The time required to compress an audio file is affected by the size of each block being extracted from the proceed audio file; this means smaller block size implies longer time required to compress the corresponding audio file.The implemented system was tested using five wave samples of data. The proposed work was implemented by using Visual Basic (6.0) as a programming language, the fidelity measure MSE and PSNR were used to check the results of the whole implemented technique. The best results obtained from the implemented system were for the test (sample - 1) in case were block size equal to (60) the compression ratio is (15.51 : 1), also for test (sample - 2) were block size equal to (40) the compression ratio is (11.03 : 1), and for small block size as in test (sample - 3) were the block size is (20) the value of the PSNR is good that its equal to (35.76 dB), also for test sample - 1 when the block size is (10) the PSNR value (45.63 dB).