Wybrane publikacje pracowników Katedry, cz. 2

Application of Algorithms Dealing with Time Domain Uncertainty for the Automatic Recognition of Musical Phrases

Kostek Bożena and Szczerba Marek. 102nd Convention of the AES, Preprint No. 4502 (N7), Munich, Germany, March 22-25, 1997.

Recognition of musical phrases requires a specific approach according to a specific structure of the phenomena. Possibility of application of different modifications causes that patterns still perceived by human as one melody can be different in time and frequency domains. Application of parametrization can reduce influence of such modifications, however some of them can still misguide recognition process. To improve recognition results a system based on dynamic time warping algorithm has been proposed. Results of preliminary tests are presented. General conclusions concerning proposed method of automatic recognition of musical phrases are derived and presented.

Application of Chebychev Polynomials to Calculation of the Nonlinear Characteristics of the Digital Waveguide Model of the Organ Pipe

Zieliński Sławomir and Szwoch Grzegorz. 102nd Audio Engineering Society Convention, Preprint No. 4499 (N4), Munich, Germany, March 22-25, 1997

Digital waveguide models of organ pipes can serve as a basis for real-time sound synthesis algorithms. A new technique of estimation of nonlinear function modeling the interaction between the air jet and the resonator of the pipe is proposed. Problems related to application of this technique to match desired spectra of organ pipe sounds are discussed.

Application of Neural Networks to the Recognition of Musical Sounds

Kostek Bożena and Królikowski Rafał. Archives of Acoustics, vol. 22, No. 1, 1997, pp. 27-50.

The aim of the presented work was to train a neural network in order to recognize a class of a chosen musical instrument. As problems related to analysis of sounds are related to human subjective perception abilities, then it seems that such tools of analyses as neural nets should be used for recognition processes. On the other hand, an artificial neural network cannot be trained directly with subsequent samples of a sound, thus the feature extraction procedure is needed at first. Unfortunately, there is no consensus regarding the selection of methods for feature vector extraction. There are few approaches to this task. Some of them are based on the source-signal relationships, on the other hand, the arbitrary choice of sound signal parameters is also possible. In the latter case, a set of parameters extracted both from time and frequency domains is created. The experiment aimed to check whether calculated parameters are sufficient for creating a set of sound patterns used for neural network training. Some neural nets were investigated in the experiment, they were trained with so-called ELEVEN and FOURTEEN vector types. After the learning procedure was executed, other examples of the previously created database (but not seen by the neural network) were presented to neural nets. Results show that NNs (neural networks) are able to generalize information included in feature vectors. However, there are some advantages and disadvantages using the NN as a decision algorithm. The main disadvantage of recognizing musical patterns by the NN is the time consumption of a learning phase. On the other hand, when presenting data to NN inputs, there is no problem with variation of parameters within data, and therefore with data clustering, because a NN has the ability to generalize information during the learning phase. In the paper, an analysis of experimental results will be carried on, and conclusions derived from the performed tests will be presented.

Artificial Neural Network As a Classifier of Musical Instrument Sounds

Kostek Bożena and Królikowski Rafał. Proceedings of the 5th European Congress EUFIT97, Aachen, Germany, September 8-12, 1997.

The rationale of this work was to develop artificial neural network for the musical instrument sound classification. For this purpose feed-forward networks using error back-propagation (EBP) algorithm and the delta learning rule were applied. Tasks related to the creation of feature vectors consisted of musical sound parameters were shortly reviewed. For the selected phases of the training process, graphic presentations of dynamic changes of the training parameters were made. Additionally, the relation between the number of iterations and the maximum admissible value of cumulative cycle error was shown. The effectiveness of identifying new objects by the network in the testing phase was presented. Conclusions were also included.

Automatic Reasoning about Acoustic Data Problems with Preprocessing, Classification and Decision Uncertainty

Kostek Bożena. Procedural Conference of the Intelligent Data Analysis (IDA-95), Baden-Baden, Germany, August 17-19, 1995. Print: [Proc.] The International Institute for Advanced Studies in Systems Research and Cybernetics, Vol.1, pp. 99-103.

There are at least three domains of applications of artificial intelligence algorithms in the domain of acoustics. First of all, it is to recall problems connected to the analysis of musical sound in order to qualify what group of musical instruments the analyzed signal belongs to. The second kind of problems related to classification of data concerns analyses of sounds in order to qualify which musical phrase, melody or even which particular musical piece the analyzed musical signals represent. That kind of analyses might provide decisions in automatic recognition of musical style. There is another kind of acoustic data resulting from the subjective testing procedures. This kind of methodology is often used in acoustics due to the influence of subjective judgments on the quality of perceived sounds. The main purpose of this paper is to discuss problems connected to preprocessing and classification of acoustic data with regard to subsequent decision making. As it is impossible to illustrate all the above mentioned topics in this paper, thus these problems will be shown on the basis of an exemplary experiment related to musical instrument timbre recognition.

Digital Waveguide Modeling of the Organ Flue Pipe

Zieliński Sławomir. Proceedings of the 19th Tonmeistertagung, Stafhalle Karlsruhe, Germany, November 15-18, 1996

The digital waveguide model of the organ flue pipe was developed. This model allows one to synthesize realistic sounds of a flue pipe, including transient states. Moreover, it is simple enough to be run in real-time on a single digital signal processor (DSP).

Feature Extraction Methods for the Intelligent Processing of Musical Signals

Kostek Bożena. 99th Convention of the AES, Preprint No. 4076 (H4), New York, NY, USA, October 6-9, 1995.

The purpose of the study was to find appropriate sound parameters that are to be used for feeding inputs of decision algorithms, such as neural network or rough set-based ones. The quality of chosen parameters was tested statistically and with the use of a neural network algorithm. Experimental results and conclusions are to be shown in the paper. Conclusions on artificial intelligence approach to the automatic recognition of musical timbre were added.

Intelligent Analysis of Musical Databases

Kostek Bożena. 4th International Workshop on Rough Sets, Fuzzy Sets, and Machine Discovery (RSFD-96), Conference Materials, pp. 300-305, Tokyo, Japan, November 6-8, 1996.

A rough set-based analysis has been applied to the analysis of musical databases. For this purpose two exemplary musical databases were constructed. The first database consisted of MIDI files is based on Bach's fugues. Another database contained information on musical timbres. Problems connected to the construction of databases and preprocessing of parameters were discussed. Relationships between parameters included in the constructed databases were shown. A rough set-based system for the recognition of musical phrases was employed for the task of the automatic classification of musical timbres. Experimental results were discussed and conclusions included.

MIDI Database for the Automatic Recognition of Musical Phrases

Kostek Bożena and Szczerba Marek. 100th Convention of the AES, Preprint No. 4169 (E-2), Copenhagen, Denmark, May 11-14, 1996.

 

Musical Databases. Construction and Analysis

Kostek Bożena, Szczerba Marek and Wieczorkowska Alicja. 19th Tonmeistertagung, Conference Materials, Stadthalle Karlsruhe, Germany, November 15-18, 1996.

 

Parametric Representation of Musical Phrases

Kostek Bożena and Szczerba Marek. 101st Convention of the AES, Preprint No. 4337 (D-3), Los Angeles, CA, USA, November 8-11, 1996.

The goal of this paper is to present musical phrase in a simplified form in order to examine the relationships between its components. For that purpose two different approaches to the musical phrase analysis, namely musicological and MIDI code-based were reviewed and applied to this work. Consequently, a database was created containing the calculated parameters. The quality of applied methods was checked using the so-called analysis-by synthesis approach, in that way recreation of the original phrase was possible. Some general conclusions concerning automatic analysis of musical phrases were derived and presented.

Parametric Representation of Musical Sounds

Kostek Bożena and Wieczorkowska Alicja. Archives of Acoustics, No. 22, 1, 1997.

The rationale of this research work was to find appropriate sound parameters on the basis of which it is possible to discern musical instrument sounds. A review of parameters used in musical acoustics was carried out focusing on the frequency-domain. Some of parameters were extracted from sound representations. Then, the quality of calculated parameters was tested statistically. Additionally, some discretization methods were applied in order to create so-called feature vectors that are to be used for feeding inputs of decision algorithms. Experimental results and conclusions are shown in the paper.

Rough Set Based Analysis of Computer Musical Storage

Kostek Bożena, Szczerba Marek and Czyżewski Andrzej. ICCIMA97, Brisbane, Australia, Month, 1997.

Two musical databases were constructed: the first one consisting of MIDI files based on Bach's fugues and the second one containing information on timbres of musical instruments sounds. Some parametrization methods were introduced to represent features of musical phrases and of musical sounds. Relationships between parameters included in the constructed databases were studied. The rough set-based algorithm was employed to the task of automatic classification of musical patterns. Some experimental results were shown and discussed.

Rough Set-Based Analysis of Musical Databases

Kostek Bożena and Szczerba Marek. EUFIT96 Conference, Vol. 1, pp. 144-148, Aachen, Germany, September 2-5, 1996.

A comparison of various approaches to the musical sound and musical phrase parametrization was presented. For that purpose four databases were created. These databases consisted of objects related to musical timbre and musical phrase representations. A short review concerning the creation of databases along with algorithmic considerations were presented. A rough set-based learning algorithm was applied in order to analyze the properties of parametrized representations. Results of such tests were presented and conclusions were derived from the experiments.

Soft Set Approach to the Subjective Assessment of Sound Quality

Kostek Bożena. InterSymp97, Baden-Baden, Germany, Month, 1997.

An attempt to assess sound quality basing on the soft set approach was made. For that purpose techniques derived from rough set theories have been implemented. The most important notions of rough set theory have been reviewed. A short description of standard testing methods in acoustic sound quality evaluation is also included. Some exemplary data derived from subjective testing are presented and then processed using non-statistical methods. Conclusions concerning applied approaches to the processing of subjective testing results are presented.

Sound Quality Assessment Based on the Rough Set Classifier

Kostek Bożena. Proceedings of the 5th European Congress EUFIT97, Aachen, Germany, September 8-12, 1997.

The aim of this paper is to present the rough set-based approach to the processing of subjective test results. In such tests the quality of sound is evaluated by listeners. They give their opinion on the overall quality of the sound or they assign certain values to assessed sound attributes. In many acoustic domains there is a need to compare sound samples and on this basis to assess the quality of audio equipment, electroacoustic devices, room acoustics, and recently introduced low bit-rate compression algorithms. As there does not exist a consensus on tested features, thus the problem to assess significance of individual attributes may be solved through the soft computing approach. The results of subjective testing are usually gathered in the decision tables containing sound attributes and experts decisions. Therefore, the rough set method was found suitable to the processing of above data. Results of experiments allowing to find hidden relations between sound attributes and experts overall decision will be presented in the final version of the paper.

Study of Parameter Relations in Musical Instrument Patterns

Kostek Bożena and Wieczorkowska Alicja. 100th Convention of the AES, Preprint No. 4173 (E-6), Copenhagen, Denmark, May 11-14, 1996.

 

Synthesis of Organ Pipe Sound Based on Physical Models

Czyżewski Andrzej, Kostek Bożena and Zieliński Sławomir. Archives of Acoustics, Vol. 21, No. 2, pp.131-147, 1996.

Problems related to the implementation of physical models based synthesis of organ pipe sound are discussed. A new approach to the physical modeling of organ pipe sound, namely waveguide synthesis is introduced. Results of some experiments with this kind of synthesis are presented. Specific features of presented methods and corresponding applications are quoted. Examples of a computer analysis of both synthesized and musical sounds were presented and compared.

A System for Musical Sound Parameter Database Creation and Analysis

Kostek Bożena and Wieczorkowska Alicja. 102nd Convention of the AES, Preprint No. 4498 (N3), Munich, Germany, March 22-25, 1997.

A concept of the system for creating databases of musical instrument sound parameters has been described. The discretization of real-value databases of musical sound parameters has been done using a choice of discretization methods. The distribution of parameter values has been investigated and visualized. Rough set systems have been applied to checking the importance of particular parameters. Conclusions concerning discretization of real-value parameters have been derived.

Application of Fuzzy Logic and Rough Sets to Audio Signal Enhancement

Czyżewski Andrzej and Królikowski Rafał. chapt. 18 in "Rough Fuzzy Hybridization. A New Trend in Decision-Making", Springer-Verlag, Singapore, pp. 397-409, 1999.

A method of noise reduction, related to spectral subtraction and controlled by intelligent algorithms, is described in the paper. A decision system based on fuzzy logic and rough sets is presented. The engineered inference algorithm exploiting rough sets is also included.

Rough Set Analysis of Electrostimulation Test Database for the Prediction of Post-Operative Profits in Cochlear Implanted Patients

Czyżewski Andrzej, Skarżyński Henryk, Kostek Bożena and Królikowski Rafał. 7th Int. Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing, Ube, Yamaguchi, Japonia, 1999

A new method of examining the hearing nerve in deaf people is presented. It consists in testing deaf people with a speech signal delivered via a microelectrode connected to a current source and attached to the promontory. The current delivered to the electrode is modulated with the speech signal, transposed downwards the frequency scale. A database of patients? data and electrostimulation test results was created, and analyzed using a rough set method in order to find rules allowing prediction of hearing recovery of cochlear implantation candidates.

Computational Approach to Spatial Filtering

Czyżewski Andrzej, Lasecki Jacek and Kostek Bożena. 7th European Congress on Intelligent Techniques and Soft Computing, Aachen, Germany, 1999.

Hearing impaired persons have difficulty in understanding speech in cocktail-party conditions. Spatial filtering may be very helpful for such people. This feature should be applied to the hearing aid, thus the computational complexity of spatial filtering-based algorithm must allow real time implementation. In order to meet this assumption some investigations were made and neural network-based algorithm was proposed. This algorithm is presented in this paper.

Investigating Polynomial Approximation of Spectra of the Pipe

Kaczmarek Andrzej, Czyżewski Andrzej and Kostek Bożena. Archives of Acoustics, vol. 24, No. 1.

A precise method for the determination of the spectral representation of pipe sounds was introduced. The polynomial approximation of the spectral envelope was found to be an effective tool, allowing the study of differences between sounds produced by organ pipes of various types belonging to some selected instruments. The paired comparison subjective testing procedure was applied in order to assess the similarities between sounds synthesized using polynomial smoothed spectra and the original organ sound patterns. The statistical processing of test results revealed that a direct relation exists between the type of organ pipe and the minimum order of the approximating polynomial that can be used to represent the pipe sound spectrum, as determined by the positive opinions of the experts. The applied pipe organ sound recording and processing methods, subjective testing procedures and experiment results are discussed in the paper.

Spatial Filtration of Sound for Multimedia Systems

Kostek Bożena, Czyżewski Andrzej and Lasecki Jacek. IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing, Copenhagen, Denmark, 1999.

This paper deals with the problem of receiving of a desired signal in noisy or ?cocktail-party" conditions. This problem is vital in many domains, such as communications, multimedia (multimodal interaction), speech recognition, and psychoacoustics (hearing prostheses). It can be partially solved by classical filtering techniques, however these techniques often introduce distortions into the filtered signal. On the other hand, as it results form experiments performed by the authors, a spatial filtration can be performed based on the Artificial Neural Network (ANN). Such an algorithm was elaborated, and some details concerning its implementation are described. Moreover, results of experiments are presented. These results demonstrate that ANN-based nonlinear filter increases the signal-to-noise ratio and improves speech intelligibility.

Multimedia Database of Musical Instrument Sounds

Kostek Bożena and Suchomski Piotr. 134 Acoustical Soc. of America Meeting, Berlin, 1999.

The presented paper addresses the broad problem of automatically recognizing musical instrument sounds. Many applications for the algorithms dealing with these tasks may be foreseen. Nowadays, with the rapid growth of electronic libraries and databases such as those found on the Internet, the possible application may be to search a musical database for the sounds of chosen instruments or for musical tunes. Therefore a multimedia database was prepared which serves as a source of data to be processed by some intelligent algorithms.

Multimedia Fitting System for Hearing Impaired People

Kostek Bożena, Czyżewski Andrzej and Suchomski Piotr. 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI'99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS'99), Orlando, 1999.

One of the most important stages in the recovery of hearing impaired people is the choice of n adequate hearing aid. The elaborated Multimedia Hearing Aid Fitting System (MHAFS) is an experimental software that allows to find the characterictics of a hearing aid matching patient's needs and to choose automatically a suitable hearing device. It is planned that this system will be made available in the Internet, so it can be used by anybody who is willing to experience a remote approximate testing of hearing characteristics and receive sounds processed like in some well fitted hearing aids. The key issues related to the engineered system will be presented in the paper.

Assessment of Concert Hall Acoustics Using Rough Set and Fuzzy Set Approach.

Kostek Bożena. Chapter in Rough-Fuzzy Hybridization: A New Trend in Decision-Making. Pal S.K., Skowron A. (Eds.), Springer-Verlag. Singapore.

Many literature refs already exist as to how to carry out the process of correlating objective measurements to subjective impressions of an interior space, but there is not yet any consensus on this still unresolved acoustical subject. Recently, a novel approach to computer assessment of acoustical quality has been made using the soft computing approach. Rough set and fuzzy set theories were used for the purpose of processing both subjective evaluation and objective measurement results.

Noise Reduction in Telecommunication Channels Using Rough Sets and Neural Networks

Królikowski Rafał and Czyżewski Andrzej. 7th International Workshop on Rough Sets, Fuzzy Sets, Data Mining, and Granular-Soft Computing, Ube, Yamaguchi, Japonia, 1999.

A new concept of reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels is proposed. This concept exploits some features of the human auditory system as well as some methods originated from soft computing domain, i.e. rough set-based reasoning and neural processing. The foundations of the engineered method and a description of applied decision algorithms are presented. A number of experiments have been prepared, and some of them have already been carried out. A brief discussion of these experiments' results and some conclusions are also included.

Noise Reduction in Acoustic Signals Using the Perceptual Coding

Królikowski Rafał and Czyżewski Andrzej. 137th Regular Meeting of the Acoustical Society of America, Berlin, Germany, 1999

A new method of noise reduction exploiting some features of the human auditory system is proposed by the authors. The noise suppression is obtained twofold: by uplifting masking thresholds and by keeping noisy components just beneath these thresholds. The foundations of the engineered method are described, and some results of the carried out experiments are briefly discussed in the paper.

Electrostimulation Tests as a Tool in Cochlear Implant Preoperative Diagnostics

Skarżyński Henryk, Czyżewski Andrzej and Kostek Bożena. 137 Acoust. Soc. of Amer. Meeting, Berlin, 1999

The procedures developed at the Institute of Physiology and Pathology of Hearing in Warsaw allow to determine some vital characteristics of the hearing sense that help to make decisions regarding the cochlear implantation. Apart from standard pre-examination procedures a test based on the electrical stimulation via the external auditory canal filled with saline can be performed. In order to evaluate the test results both the dynamics range defined by auditory threshold and uncomfortable loudness level and the Time Difference Limen Test are considered. Moreover in some deaf patients a speech communication was achieved with the use of the ball shaped electrode and the spectral compression of speech signal. In this way, the interpretation of the electrical stimulation test results for the new diagnosed cases was made more reliable.

Noise Reduction in Audio Employing Auditory Masking Approach

Czyżewski Andrzej and Królikowski Rafał. Proc. of the 106th AES Conv., Preprint 4930, Munich, Germany, 08-11 May, 1999.

A new method of noise reduction which exploits some features of the human auditory system is proposed by the authors. The noise suppression is obtained twofold: by uplifting masking thresholds and by keeping noisy components just beneath these thresholds. The foundations of the engineered method are discussed extensively in the paper, and some engineered perceptual noise reduction algorithms are described. The way of introduction of the noise reduction features into an MPEG encoder is demonstrated.

Recognition and Prediction of Music, a Machine Learning Approach

Szczerba Marek. 106th AES Convention, Munich, Germany, 1999.

This paper contains a description of a machine-learning-based system for a recognition and prediction of music. The presented system uses advanced data-mining algorithms: neural networks and rough-sets. The system was applied for two main purposes: recognition of musical structures (phrase, rhythm and harmony) and for prediction of musical elements (melody, rhythm and harmony). The system was optimized for each of the purposes. The problems related to the optimization process are presented. Conclusions concerning application of the machine learning methods to the music domain are derived and included.

Applications of Rough Sets and Neural Nets to Noisy Audio Enhancement

Królikowski Rafał and Czyżewski Andrzej. CD-ROM Proc. of the 7th European Congress on Intelligent Techniques and Soft Computing, Aachen, Germany, 13-16 September, 1999.

A new concept of reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels is proposed. This concept exploits some features of the human auditory system as well as some methods originated from artificial intelligence domain, i.e. reasoning based on rough sets and neural processing. The foundations of the engineered method together with a description of applied intelligent decision algorithms are presented in the paper. A number of experiments have been prepared, and some of them have already been carried out. Hence, a brief discussion on the results of these experiments and some conclusions are also included in the paper. The main focus is put on a comparison between different intelligent methods used to non-stationary noise reduction.

Intelligent Echo and Noise Reduction

Czyżewski Andrzej, Królikowski Rafał, Zieliński Sławomir and Kostek Bożena. Proc. of the 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI'99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS'99), Vol. 4, pp.234-238, Orlando, USA, 31 July -04 August, 1999.

New concepts of echo cancellation and reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels are proposed. In the both cases, some methods originated form artificial intelligence domain, i.e.: genetic algorithms, neural networks, rough sets are applied. In turn, in the noise reduction method, some features of the human auditory system are presented in the paper. Furthermore, a number of experiments have been carried out, and a brief discussion on some of them is included in the paper.

Noise Reduction in Audio Signals Based on the Perceptual Coding Approach

Czyżewski Andrzej and Królikowski Rafał. Proc. of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 147-150, New Paltz, NY, USA, 17-20 October, 1999.

A new concept of the reduction of noise affecting audio signals transmitted in telecommunication channels is proposed. This concept is exploiting some features of the human auditory system. A strong subjective effect of noise suppression in noisy audio can be obtained by uplifting masking thresholds above the estimated level of noisy components or by reducing this level in such a way that the components be maintained just below masking thresholds. The foundations of the engineered method together with the appropriate algorithms are described in the paper. A brief discussion on the results of carried out experiments and some conclusions are also included in the paper. The main focus is put on perceptual foundations of the noise reduction method.

Echo and Noise Reduction Methods for Multimedia Communication Systems

Czyżewski Andrzej, Królikowski Rafał, Zieliński Sławomir and Kostek Bożena. Proc. of the IEEE Signal Processing Society 1999 Workshop on Multimedia Signal Processing, pp.239-244, Copenhagen, Denmark, 13-15 September, 1999.

New concepts of echo cancellation and reduction of non-stationary noise affecting audio signals transmitted in telecommunication channels are proposed. In the both cases, some methods originated form artificial intelligence domain, i.e.: genetic algorithms, neural networks, rough sets are applied. Moreover, in the noise reduction method, some features of the human auditory system are exploited. A number of experiments have been carried out, and a brief discussion on some of them is included in the paper.

A Method for Echo Cancellation in Audio Signals Using the Genetic Algorithm

Czyżewski Andrzej and Zieliński Sławomir. CD-ROM Proc. of the Joint Meeting, 137th regular meeting of the Acoustical Society of America and the 2nd convention of the EAA: Forum Acusticum - integrating the 25th German Acoustics DAGA Conference, Berlin, Germany, 14-19 March, 1999.

In this paper, a new method of echo cancellation is proposed. This method is based on the use of models of systems causing the echo. Parameters of such models are optimized using the genetic algorithm. The computational cost of the proposed method can be minimized by the application of the correlation function.

New Method of Echo Cancellation

Zieliński Sławomir. Proc of the 8th International Symposium on Sound Engineering and Mastering, pp. 31-34, Gdansk, Poland, 9-11 September, 1999.

New method of echo canceller based on the genetic algorithm is proposed. This method is based on the use of models of systems causing the echo. Parameters of such models are optimized using the genetic algorithm.

Sound Synthesis Using Digital Waveguide Modeling

Zieliński Sławomir. Proc of the 8th International Symposium on Sound Engineering and Mastering, pp. 213-216, Gdansk, Poland, 9-11 September, 1999.

A method of digital waveguide modeling was elaborated at Stanford University about ten years ago. Since that time one can observe still increasing growth of its popularity among the researches and companies producing electronic instruments. In this paper, fundamentals of the digital waveguide modeling will be reviewed. Moreover, exemplary sound examples obtained using this method will be presented.

Computer Techniques in Electrostimulation Testing of Hearing and Hearing Aid Modeling

Skarżyński Henryk, Czyżewski Andrzej Kostek Bożena Szwoch Grzegorz. 3rd World Multiconference on Systemics, Cybernetics and Informatics (SCI'99) and the 5th International Conference on Information System Analysis and Synthesis (ISAS'99), Orlando 1999.

In this paper two aplications of computer techniques in audiology are presented. In the first part of the paper, a new method to examine electrostimulation of structure of the auditory tract, developed in the Institute of Physiology and Pathology of Hearing, is described. The study is dedicated to the problem of an evaluation of the auditory nerve electrical sensitivity in deaf people, assisted by computer technology. A new method is proposed, which enables an assessment of both hearing loss in a given moment of time and the future benefits of the cochlear implant to the patient. In the second part of the paper, a new method of fitting acoustical elements of a hearing aid is proposed. A digital waveguide model of these elements is designed. Next,on the basis of this model computer simulations are performed. It is possible to obtain the desired shape of transfer function of the model by changing the values of its parameters. Resulting a computer simulation dimensions of the physical system can be calculated. This method can be used to design acoustical elements of a hearing aid, having desired acoustical properties. Both applications, although aimed at different group of patient.

Prediction of Post-Operative Profits in Cochlear Implanted Patients Using the Electricostimulation Procedure

Skarżyński Henryk, Czyżewski Andrzej and Kostek Bożena. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, 1999

The presented research is devoted to the problem of evaluation of the auditory nerve electrical sensitivity in deaf people. In the case of profound hearing impairments the assessment of the degree of hearing loss by using standard acoustic tests such as tonal or vocal audiometry, ABR testing, impedance audiometry, etc. would often conclude in a complete lack of response to an acoustic stimulus in the patient. That is why other diagnostic methods that would enable evalualion of the auditory nerve electrical sensitivity have been designed and introduced to the clinical practice. The method of speech signal transmission to the auditory nerve before cochlear implantation was conceived and tested. This method uses spectral transposition of signal delivered to the external electrode allowing to stimulate the auditory nerve.

Modeling of Acoustics of Hearing Aid Earmold Systems

Szwoch Grzegorz, Kostek Bożena. 137 Acoust. Soc. of Amer. Meeting, Berlin 1999.

This paper addresses problems related to modeling acoustical parts of hearing prostheses. In the case of the behind the ear (BTE) hearing aid sound from the transducer is transferred to the auditory channel. Serving this purpose is the acoustic waveguide of the hearing aid. In more advanced hearing aids, this part is acoustically fitted, according to the patient's needs. This is done experimentally during the fitting process. On the other hand, modeling the waveguide can be based on the physical modeling of acoustical systems. The proposed approach and some preliminary results will be presented in the paper.

Designing Waveguide Elements of a Hearing Aid Using the Physical Modeling Techniques

Szwoch Grzegorz, Kostek Bożena, Czyżewski Andrzej. 106th AES Convention, Munich 1999.

The aim of this paper is to model a desired transfer function of a hearing aid. For this purpose physical modeling techniques were used in order to change parameters of a model in real time. The main features of a system allow one to design waveguide elements of a hearing aid. Such a system may be helpful in the process of fitting some hearing aid elements.

A Novel Approach to Echo Cancellation

Czyżewski Andrzej and Zieliński Sławomir. 106th Audio Engineering Society Convention, Munich, Germany, 1999

In this paper a new method of echo cancellation is proposed. This method is based on the genetic algorithm.

Simulating Acoustics of Hearing Aid Employing Non Linear Signal Filtering and Waveguide Modeling

Szwoch Grzegorz, Kostek Bożena, Czyżewski Andrzej. 108th AES Convention, Paris 2000.

A model of hearing aid is designed and used to perform computer simulations that include signal processing (amplification, filtering and compression) as well as transmitting the sound to the ear by the acoustical waveguide. The method and some results of simulations are presented applicable to the process of fitting the hearing aid to the individual patient's needs.

Determining Influence of Visual Cues on the Perception of Surround Sound Using Soft Computing

Czyżewski Andrzej, Kostek Bożena, Odya Piotr, Zieliński Sławomir. RSCTC'2000, Banff, Canada, 2000.

Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. Techniques and standards involved in digital video processing are much more developed than concepts underlying creating recording and mixing of the multichannel sound. The main challenge in the sound processing in the multichannel system is to create an appropriate basis for the relating multimodal context of visual and sound domains. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV. However, there is not much study done yet that associates surround sound and digital video presented at the TV screen. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be solved with the application of fuzzy logic to the processing of subjective test results.

Influence of visual cues on the perception of surround sound

Czyżewski Andrzej, Kornacki Artur, Kostek Bożena, Odya Piotr, Zieliński Sławomir. 139th Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer., Atlanta, USA, 2000

Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. Techniques and standards involved in digital video processing are much more developed than concepts underlying creating recording and mixing of the multichannel sound. The main challenge in the sound processing in the multichannel system is to create an appropriate basis for connecting multimodal context of visual and sound domains. Therefore one of the purposes of experiments is to study in which way and how the surround sound interfere or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV. However, there is not much study done yet that associate surround sound and digital video presented at the TV screen. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be addressed in the paper.

Neuro-Rough Control of Masking Thresholds for Audio Signal Enhancement

Czyżewski Andrzej, Królikowski Rafał. Journal of Neurocomputing

The paper addresses the problem of neuro-rough hybridisation applied to non-stationary noise reduction. The goal of the intelligent controller is to estimate the current statistics of corrupting noise on the basis of the analysis of signals taken from telecommunication channel. Thereafter, the noise estimate enables determining the masking threshold levels which allow making the noise inaudible in the audio. Since the implemented decision algorithm requires quantised data, thus the Kohonen?s self-organising maps extended by various distance metrics were used as data quantisers. Some results of the experiments in the domain of non-stationary noise reduction in speech are discussed in the paper.

Expert System for Hearing Aids Fitting

Czyżewski Andrzej, Kostek Bożena, Suchomski Piotr. 108th AES Convention, Paris, France, 2000

The engineered experimental software allows to find the characteristics of a fearing aid matching patients needs and to choose automatically a suitable hearing device characteristics. The key issues related to the engineered application are based on the expert system implementation. This expert system uses both fuzzy logic and rough set processing of analytical data. The principles of the engineered expert system application and some details of the rough set and fuzzy logic implementation will be presented in the paper.

Multimedia Technology Based Orientation System for Visually Impaired People

Czyżewski Andrzej, Kostek Bożena. 4th World Automation Congress, WAC 2000, Maui, Hawaii, USA, 2000

The research performed by authors consisted in multimedia system that aimed at enabling people orientating in their surrounding and to avoid any kind of obstacles and theats. The latter aim may be achieved by an intelligently controlled synthesis of the acoustic field based on the digital image analysis. One of the features of such a system should be the ability to identify the location and to describe the dimension of obstacles in the environment. The idea of perceiving the "sound picture" instead of the visual one brings with itself many issues that are closely related to the research subject discussed in the paper.

An Approach to the Automatic Classification of Musical Sounds

Czyżewski Andrzej, Kostek Bożena. 108th AES Convention, Paris, France, 2000

A study on the automatic classification of musical instrument sounds is presented. For this purpose a large database of musical instrument sounds was built, which consists of both solo and duet stereo recordings. The classification process of musical instrument sounds is done on the basis of some soft computing techniques, such as neural networks. The results of the classification are given as a percentage of musical instrument sounds properly recognized by the system. A discussion of the system efficiency and of its limitations is presented. Conclusions and remarks concerning further development of this study are included.

Automatic classification of musical instrument sounds

Kostek Bożena. 139th Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer., Atlanta, GA, USA, 2000

The aim of the presented study is to show that the process of automatic classification of musical instrument sounds is possible on the basis of a limited number of parameters. However, due to the complexity as well as to the unrepeatable nature of musical sounds, both steady- and transient-states should be taken into account while creating feature vectors. For this purpose a database of musical instrument sounds was built containing various instrument sounds played with a different articulation. Then, this database was used in further experiments consisted of some stages, i.e. preprocessing, parameterization and pattern recognition. The main subject of this study was the optimization of the set of parameters to be included in the feature vectors.

Multimedia Hearing Aids Fitting System

Kostek Bożena, Czyżewski Andrzej, Skarżyński Henryk, Mazur J. 4th World Automation Congress, WAC 2000, Maui, Hawaii, USA, 2000

The application described in the paper is concerned with automatic finding the dynamic characteristics of the hearing aid matching patients needs. The multimedia computer technology makes it practical to organize hearing aid fitting basing on the computer software. Consequently, the proposed method of testing hearing abilities and finding the adequate hearing aid dynamical processing characteristics can be based entirely on multimedia computer technology.The subject of the application is the method of hearing aids fitting employing compressed speech understanding tests in noise and the way of organizing such procedure of hearing aids fit in.

Shift in Localization of Phantom Sound Sources in Surround Sound versus Video Context

Kostek Bożena, Czyżewski Andrzej, Odya Piotr. 21st Tonmeistertagung, Hannover, Germany, 2000

Contemporary digital video, film or multimedia presentations are often accompanied by the surround sound. The visual objects displayed on the screen can affect perception of the phantom sound sources in surround panorama. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. The main issue in such experiments is the analysis of the influence of visual cues on perception of the surround sound. This problem will be solved with the application of fuzzy logic to the processing of subjective test results.

Exploitation of Self-Organising Maps for the Reduction of Non-Stationary Noise in Speech Signals

Królikowski Rafał. ICSC (International Computer Science Conventions) Neural Computation, Berlin, 2000

The paper addresses the problem of reduction of non-stationary noise on the basis of exemplary system architecture. Some innovations were introduced in the system: vector quantisation (VQ) of the altering in time noise statistics, application of Kohonen self-organising maps (SOM) as an intelligent controller for managing VQ, and exploitation of some masking properties of the human auditory system. In the paper, some results of the experiments in the domain of non-stationary noise reduction in speech are also discussed.

Localization of Sound Sources by Means of Recurrent Neural Networks

Królikowski Rafał, Czyżewski Andrzej, Kostek Bożena. The Second International Conference on Rough Sets and Current Trends in Computing, Banff, Kanada, 2000

The issue of localization of sound sources for videoconferencing is discussed in the paper. A new algorithm for estimating speaker locations, based on recurrent neural networks (RNN), is introduced and described. The scheme of experiments carried out in an acoustically adopted chamber, exploiting the engineered method is detailed.

Simulation of the Reverberant Space in the Multichannel Audio Using the Convolution Method

Czyżewski Andrzej, Kornacki Artur, Szwoch Grzegorz, Kostek Bożena. 17th International Congress on Acoustics, Rome, Italy, 2001

The convolution method is commonly used to simulate the reverberant space by convolving monophonic or stereophonic sounds with the impulse responses of the room.In this paper,application of this method to the multichannel audio is proposed. The impulse responses of the real room were recorded.Each of the audio channels was obtained using the convolution of the adequate room impulse response with monophonic source sound.The results of the convolution were then combined and encoded as the multichannel surround audio in the format 5.1. The time and spectral analyses of the resulting sounds,as well as the listening tests were performed.The results of these experiments are presented and discussed in the paper. The presented method allows one to simulate the acoustical conditions of the room where the monophonic audio was acquired. Possible applications of this method include advanced Internet teleconferencing in which the bandwidth requirements may be decreased by transmitting only monophonic sounds and the impulse responses of the room instead of the whole multichannel audio.

Automatic Identification of Sound Source Direction Based on Neural Networks

Czyżewski Andrzej, Królikowski Rafał. 142nd Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer., Fort Lauderdale, USA, 2001

In this paper a method for automatic detection of sound source was studied. Both standard feed-forward- and recurrent neural networks were employed in that method. Comparison of the results obtained is given. Conclusions are also derived.

Neural Networks Applied to Sound Source Localization

Czyżewski Andrzej, Królikowski Rafał, Kostek Bożena. 110th Audio Engineering Society Convention, Amsterdam, Netherlands, 2001

The primary aim of this paper is to show that it is possible to localise the direction of the incoming acoustical signal based on the neural network trained for that purpose. Consequently, the automatically localised acoustical signal may be attenuated if it obscures the desired target sound. A set of parameters was formulated in order to localise target source and unwanted signals. In order to process acoustical signals incoming from various directions at the same time the neural network-based system was designed and implemented. The feature extraction method is thoroughly discussed, the training process is described and recently obtained results are discussed.

Acquisition of Acoustic Signals Assisted by Recurrent Neural Networks

Czyżewski Andrzej, Królikowski Rafał. 17th International Congress on Acoustics, Rzym, Włochy, 2001

The issue of localisation of sound sources for videoconferencing is addressed in the paper, where a new method for estimating speaker locations is introduced. It is based on exploitation of temporal relationships between signals received by an array of microphones, and thereby recurrent neural networks are employed. Additionally, a parametrisation of the time-domain audio signals prior to the neural processing is performed. Some of the results of the experiments are briefly presented in the paper.

Determining Influence of Visual Cues on the Perception of Surround Sound Using Soft Computing

Czyżewski Andrzej, Kostek Bożena, Odya Piotr, Zieliński Sławomir. Series: Lecture Notes in Computer Science, vol. 2005, Springer-Verlag, 2001

The main challenge in the sound processing in the multichannel system is to create an appropriate basis for the relating multimodal context of visual and sound domains. Therefore, one of the purposes of experiments is to study in which way and how the surround sound interferes or is associated with the visual context. This kind of study was hitherto carried out when two-channel sound technique was associated with a stereo TV

Digital Waveguide Models of the Panpipes

Czyżewski Andrzej, Jaroszuk Jarosław, Kostek Bożena. ISMA'2001, Perugia, Italy, 2001

The principal aim of this paper is to present a digital waveguide model of the Panpipes. For the efficient modeling of the Panpipes instrument its structure and its physics were studied and thoroughly discussed. The acquired knowledge was then used during the construction of the model. In this context principles of the digital waveguide modeling of woodwind instruments are shortly reviewed. Because of the simplicity of designing the digital waveguide as a set of delay lines and scattering junctions the model can be easily implemented to a digital signal processor. In the paper two digital waveguide models of the Panpipes instruments were presented. They differ from each other by their complexity. This was due to examining the influence of decreasing the complexity of the model on the synthetic sound quality. The performed subjective tests resulted in showing that introduced simplifications in digital waveguide models reveal no noticeable influence on the sound quality. A comparison between synthetic and real Panpipes sounds was made. The results of both subjective tests and objective analyses obtained using engineered models of Panpipes are also included in the paper. Conclusions are derived.

Waveguide Modeling of Ancient, Japanese Musical Instruments

Czyżewski Andrzej, Kostek Bożena, Zieliński Sławomir. ISMA'2001, Perugia, Italy, 2001

Problems related to the implementation of physical modeling-based synthesis of two traditional Japanese instruments are discussed. Examples of computer analyses of sounds of shakuhachi and koto are presented. On the basis of these analyses some assumptions concerning waveguide models were made. Physical modeling principles of musical instrument sounds generation were also shortly reviewed. Main differences in modeling wind and string instruments were highlighted. The process of constructing models of these two musical instruments was explained. A short discussion concerning problems occurred while creating such models was given. Some general conclusions concerning real-time implementation of the digital waveguide models were also included.

Neuro-Rough Control of Masking Thresholds for Audio Signal Enhancement

Czyżewski Andrzej, Królikowski Rafał.

The paper addresses the problem of neuro-rough hybridisation applied to non-stationary noise reduction. The goal of the intelligent controller is to estimate the current statistics of corrupting noise on the basis of the analysis of signals taken from telecommunication channel. Thereafter, the noise estimate enables determining the masking threshold levels which allow making the noise inaudible in the audio. Since the implemented decision algorithm requires quantised data, thus the Kohonen's self-organising maps extended by various distance metrics were used as data quantisers. Some results of the experiments in the domain of non-stationary noise reduction in speech are discussed in the paper.

Dereverberation Based on the Genetic Algorithm

Czyżewski Andrzej, Zieliński Sławomir. 17th International Congress on Acoustics, Rome, Italy, 2001

In this paper, a new method of echo cancellation is proposed applicable to some telecommunication systems. This method is based on the application of the reverse model of a system causing echo. Parameters of such a model are optimized using the genetic algorithm. Some exemplary results of echo cancellation obtained with the use of the proposed method are discussed.

Encoding Spatial Information for Advanced Teleconferencing

Czyżewski Andrzej, Królikowski Rafał, Kostek Bożena. 19th International AES Conference, Schloss Elmau, Germany, 2001

The aim of this paper is to show a system that enables automatic identification of a sound source position in noisy acoustical conditions with a considerable accuracy. Automatic detection of sound source in such an acoustical environment is much needed in advanced teleconferencing. The approach shown in the paper is based on Artificial Neural Networks (ANNs) used for automatic sound localisation. Both standard feed-forward ANNs and Recurrent Neural Networks (RNNs) are employed for that purpose. Comparison of the results obtained, based on both types of ANNs, is also given. Conclusions are derived and shown.

The Internet Sound Restoration Service Based on the Perceptual Denoising Method

Czyżewski Andrzej. 20th Audio Eng. Soc. International Conference, Budapest, Hungary, 2001

The Internet service was launched intended to on-demand restoration and publishing of audio content related to world's cultural heritage. A special way of acquiring, processing and publishing archive recordings was conceived in order to ensure a proper dissemination of the proposed service and its long-term maintenance. The sound enhancement method underlying the system operation employs the extended perceptual coding of audio material allowing for simultaneous noise reduction and sound compression. Moreover, the non-linear predictor employing neural networks was applied to the detection and removal of impulse distortions. The system is still in the development phase, thus both: system features implemented already and technical assumptions related to its further development are presented in the paper.

Discovering the Influence of Visual Stimuli on The Perception of Surround Sound Using Genetic Algorithms

Czyżewski Andrzej, Kostek Bożena, Odya Piotr, Smolinski T. 19th International AES Conference, Schloss Elmau, Germany, 2001

The paper contains a description of experiments that aim to determine visual cue influence on the perception of spatial sound. Earlier stage of the carried out experiments showed that there exists a relationship between the perception of video presented in the screen and sound signals reproduced in a surround system. However, this relationship is dependent on the type of audio-visual signals. Thus a series of subjective tests has been performed on dozens of experts in order to discover these dependencies. The main issue in such experiments is the analysis of the influence of visual cues on the perception of the surround sound. This problem is solved with the application of genetic algorithms to the processing of subjective test results. Conclusions concerning the complexity of the investigated problem are included.

Applications of Neural Networks and Perceptual Masking to Audio Restoration

Czyżewski Andrzej. Journ. of New Musical Reseach, vol. 30, No. 4

Applications of learning algorithms to the restoration of recordings are presented. Attention is paid to the usage of artificial neural networks as a decision system determining which components of an input signal are valid and which ones are unwanted. It provides a basis for the parasitic impulse detection and for the interpolation of lost signal intervals. Such an approach enables also an efficient noise reduction employing the extended perceptual coding algorithm. The proposed algorithms are described briefly in the paper, obtained results are discussed and some general conclusions concerning the application of soft computing and perceptual masking to sound restoration are added.

Problems Related to Surround Sound Production

Kornacki Artur, Kostek Bożena, Odya Piotr, Czyżewski Andrzej. 110th AES Convention, Amsterdam, Netherlands, 2001

The problem of production of recordings designated for sound surround systems becomes a vital problem in sound technology. Existing standards of surround systems allow for reproduction of spatial sound. However, there are no consistent recommendations as to which microphone and mixing technique could be used in specific situations. For the purpose of research presented in this paper several microphone techniques were used for recordings of a quartet playing classical music. The mixing results in two-channel excerpts and several multichannel ones designated for 5.1 reproduction system. Then, in order to find the most preferable recording technique these excerpts were used in subjective tests.

Multimedia Techniques Applied to Health Care Procedures- Hearing Aid Fitting expert System

Kostek Bożena, Czyżewski Andrzej. 46 Internationales Wissenschaftliches Kolloquium, Ilmenau, Germany, 2001

In this paper an exemplary implementation of the complex multimedia system in the domain of the health care and its integration to the user environment is shown. The engineered Multimedia Hearing Aid Fitting Expert System is an experimental software program that allows finding automatically characteristics of a hearing aid matching patients needs. The fitting of the hearing aids is based either on classical methods that use audiometric test results or on loudness scaling principles. All these methods are based on artificial test signals. However, the fitting of hearing aids should be performed on the basis of testing speech understanding in noise. A satisfying reliability of these tests may be achieved through the use of modern computer technology, properly calibrated. The principles of the engineered software application, some details of the calibration process, and results of the experiments will be presented in the paper.

Expert system for Musical Style Recognition

Kostek Bożena. International Workshop: Human Supervision and Control in Engineering and Music, Stadthalle Kassel, Germany, 2001

In this overview some concepts concerning sound engineering, computer music and human supervision are presented. Multimodal-computer interactions consist in, among others, collecting and intelligent searching music related-information. Some concepts related to the author's experience will be presented. Key findings in sound engineering allow recording music in a natural way. Computers can be employed as both Internet sites collecting music-related data and as algorithmic tools that enable musicians to find needed information. They allow analyzing a given melody, modify it in musically sensible ways, mimic the human way of composing, etc. Human supervision is needed at both stages. The quality of recording cannot be assigned otherwise than subjectively. Organizing a computer site containing music-related information needs also the supervising of the future user. Developing artificial intelligence algorithms and designing ergonomic user interfaces is also a task for a human supervisor.

Wavelet-based automatic recognition of musical instruments

Kostek Bożena, Żwan Paweł

The objective of the present work is to automatically extract information from monophonic sounds. This process consists of several stages, namely, preprocessing, parameterization, and classification. This paper shows a thorough study on the wavelet-based parameterization of musical instrument sounds and automatic recognition by means of artificial neural networks (ANNs). First, an engineered method of pitch detection is presented and exemplified by several analyses. A short discussion on error associated with automatic pitch tracking is also included. Then, examples of time-frequency analyses of various musical instrument groups are presented. The analyses are performed employing a database containing musical sounds recorded at the Sound and Vision Engineering Department, Technical University of Gdansk. On the basis of such analyses a set of parameters is derived. Feature vector properties are then discussed. For that purpose Fisher statistics is used. It allows checking the separability between musical instrument pairs. In addition, for the purpose of automatic recognition of musical instrument groups artificial neural networks are used. Various structures and training methods of the ANNs are examined. Exemplary results obtained in the carried out investigations are provided and analyzed. Concluding remarks concerning further development of such experiments are also included in the paper.

Management of Musical Data

Kostek Bożena. International Workshop: Human Supervision and Control in Engineering and Music, Stadthalle Kassel, Germany, 2001

In this overview some concepts concerning future perspectives of transdisciplinary research will be presented. There are many problems related to the management of musical data that are not solved up to now. These problems are being extensively developed within the Music Information Retrieval field now. Topics that should be addressed within the scope of this discussion, but not limited to, are as follows: the problem of automatically classifying musical instrument sounds and musical phrases/styles, music representation and indexing, estimating similarity of music using both perceptual and musical criteria, problems of recognizing music using audio or semantic description, building up musical databases, evaluation of MIR systems, intellectual property right issues, user interfaces, issues related to musical styles and genres, language modeling for music, user needs and expectations, auditory scene analysis, gesture control over musical work, etc. Some of these topic are covered by the MPEG 7 standardization process, which describe the multimedia content data that will support some degree of interpretation of the information meaning, which can be passed onto, or accessed by, a device or a computer code (MPEG-7)

Audio Material Extraction from the Internet Databases

Kostek Bożena. 46 Internationales Wissenschaftliches Kolloquium, Tagungsband, 2001

The paper will outline the problems related to automatic search for audio material. The aim of this paper is to show how to automatically recognize individual musical instrument sounds contained in the Internet sites or multimedia databases. This feature is highly needed in today's Internet browsers. In order to recognize musical instruments properly several stages are needed, namely preprocessing, parameterization, and the actual recognition/classification process. The classification process of musical instrument sounds can be done by means of soft computing techniques that use learn-and-test approach. The main principles of methods for the automatic recognition/classification of musical instrument sounds developed and tested at the Sound & Vision Engineering Department, Technical University of Gdansk will be described. Key challenges in the multimedia technology devoted to this problem will be also presented.

In Search for Surround Sound Recording Techniques

Kostek Bożena, Czyżewski Andrzej. ISMA'2001, Perugia, Italy, 2001

The existing and recently introduced standards of surround systems allow for reproduction of spatial sound in almost any room conditions. The vital concern of sound production for surround systems is the number of microphones, their positioning, proportion between direct sound, early reflections and the reverberation, artificially added delays, etc. The proper solution of such problems may result in creating spatial impression that is comparable to the live music perception. However this kind of a study should address some of the questions related to surround sound production. The broader aim is to establish recommendations as how to produce recordings of classical music designated for sound surround systems in specific acoustical conditions and then to reproduce it properly. This paper shows a study in which several microphone techniques were used for recordings of classical music in two auditory halls having different acoustical properties. Based on these recordings and various mixing techniques two channel stereo excerpts and some multichannel ones were produced. The latter were encoded in 5.1 multichannel format. The extensive subjective tests were performed employing a group of sound engineers and students in order to find the most preferable recording techniques. The listening tests were first performed employing excerpts obtained for each room separately, then the best production was compared for two rooms. The subjective tests were carried out in the same listening room equipped with the 5.1 surround reproduction system. In the paper results of such a comparison tests are shown. The methodology of carrying out subjective tests is presented. The discussion of obtained results and some conclusions are also included.

Representing Musical Instrument Sounds for Their Automatic Classification

Kostek Bożena, Czyżewski Andrzej. J. Audio Eng. Soc., vol. 49, No. 9

A study of the automatic classification of musical instrument sounds is presented. For this purpose a database of musical instrument sound parameters was built which consists of musical instrument recordings and their parametric representations. The parameterization process was conceived and performed in order to find significant musical instrument sound features and to remove redundancy from the musical signal. Classification experiments of musical instrument sounds were performed with neural networks allowing a discussion of the efficiency of the feature extraction process and its limitations. Conclusions and remarks concerning further development of this study and its relation to the current MPEG-7 standardi-zation process are included.

Automatic Recognition of Musical Instrument Sounds - Further Developments

Kostek Bożena, Czyżewski Andrzej. 110th Audio Eng. Soc. Convention, 110th Audio Eng. Soc. Convention, 2001

Discussion on the subject of retrieval of musical data from Internet or multimedia databases, which is carried out now for some time does not successfully reach its final stage of application. There are still many problems related to the subject of automatic recognition of music or musical instrument sounds that cannot be easily solved. Especially important is to find adequate parameters of musical signal based on time and frequency and/or wavelet analyses. Proposed feature vectors were derived on the basis of the constructed databases that contain recorded musical sounds. The presented study shows methods of automatic identification of musical instruments based both on classical statistical and soft computing approaches. They were used then to classify musical instruments. A set of results obtained in the carried out investigations is provided and analyzed and concluding remarks are included in the paper.

Internet-Based Automatic Hearing Assessment System

Kostek Bożena, Czyżewski Andrzej, Skarżyński Henryk, Kochanek K. 46 Internationales Wissenschaftliches Kolloquium Ilmenau, Ilmeanu, Germany, 2001

The aim of this paper is to show the new media application to the domain of health care. In the paper the Internet-based system that allows for automatic testing of hearing is described. Hearing impairment is one of the fastest growing diseases of modern society. Therefore it is very important to organize mass screening tests to identify people suffering from this kind of impairment. The described application provides a test that uses automatic questionnaire analysis, audiometric tone test procedures, and assesses speech intelligibility in noise. When all the testing is completed, the system automatically analyzes the results for each person examined. Based on the number of incorrect answers, the decision is made automatically by the expert system: does the person have normal hearing or does he or she have hearing problems and requires to be examined in one of the consulting centers? Those whose hearing impairment is confirmed are referred to treatment in rehabilitation centers. All these centers are connected via the Internet and are provided with special distributed database access allowing them to automatically register and track the patient discovered during the remote screening.

A method for the automatic hearing aid fitting employing speech in noise

Kostek Bożena, Czyżewski Andrzej. 142nd Acoustical Soc. of America Meeting, Fort Lauderdale, USA, 2001

Some limitations of the hearing aid fitting process are discussed. The classical procedures in this process are based on audiometric test results and/or the loudness scaling method employing artificial test signals. However, the fitting of hearing aids should be also performed on the basis of testing speech understanding in noise, because this is much closer to the real life conditions. A satisfying reliability of these tests may be achieved through the use of modern computer technology with an application of a properly calibrated sound system. A new strategy applicable to fitting prostheses was developed. It allows finding automatically characteristics of a hearing aid matching patients needs. The principles of the fitting method employing fuzzy reasoning, and some results of the experiments will be presented in the paper.

Automatic Recognition of Musical Instrument Sounds

Kostek Bożena. ICA'2001, Rome, Italy, 2001

The presented study aims at the possibility of automatic identification of musical instruments based on signal processing and some intelligent decision techniques. This study points out at automatic retrieval of musical sounds from Internet databases. Several stages should be performed before the actual recognition process takes place. Especially important is to find adequate descriptors of musical sounds. Appropriate sound parameters are to be used for feeding inputs of decision algorithms. They should be well related to sound characteristics, both objectively measured and subjectively perceived. Proposed feature vectors are derived on the basis of thorough examination of sound analysis results. Parameters are looked for in the frequency and time-frequency domains. A discussion concerning the choice of parameters that might be contained in the feature vectors is also included. An expert system based on some classification methods, both classical and soft computing ones is used for automatic classification purposes. Exemplary results obtained in experiments and derived conclusions are included in the paper.

Employing Fuzzy Logic and Noisy Speech for Automatic Fitting of Hearing Aids

Kostek Bożena, Czyżewski Andrzej. 142nd Acoustical Soc. of America Meeting, Fort Lauderdale, USA, 2001

In this paper some limitations of the hearing-aid fitting process are discussed. In the fitting process, an audiologist performs tests on the wearer of the hearing aid, which is then adjusted based on the results of the test, with the goal of making the device work as best as it can for that individual. Traditional fitting procedures employ specialized testing devices which use artificial test signals. Ideally, however, the fitting of hearing aids should also simulate real-world conditions, such as listening to speech in the presence of background noise. Therefore, more satisfying and reliable fitting tests may be achieved through the use of multimedia computers equipped with a properly calibrated sound system. We have developed a new automatic system for fitting hearing aids. It employs fuzzy logic. In this process, a computer makes choices for adjusting the hearing aid's settings by analyzing the patient's responses and answering questions with replies that can lie somewhere between a simple "yes" or "no." This paper will describe the method and present some results of the experiments conducted to test the system.

Localization of Sound Sources by Means of Recurrent Neural Networks

Królikowski Rafał, Czyżewski Andrzej, Kostek Bożena. Series: Lecture Notes in Computer Science, vol. 2005, Springer-Verlag, 2001

The issue of localization of sound sources for videoconferencing is discussed in the paper. A new algorithm for estimating speaker locations, based on recurrent neural networks (RNN), is introduced and described. The scheme of experiments carried out in an acoustically adopted chamber, exploiting the engineered method is detailed.

Prediction of the Reverberation Time in Rectangular Rooms with Non-Uniformly Distributed Sound Absorption

Neubauer R., Kostek Bożena. Archives of Acoustics, vol. 26, No. 3

The aim of this paper is first to review the best known reverberation time formulae and then to show that they cannot predict the reverberation time accurately in cases mostly encountered in practice, where the sound field is not diffuse. Introducing a correction to the Fitzroy's formula allows predicting better the reverberation time in the case of non-uniformly distributed sound absorption. Comparison of calculation results obtained on both the basis of classical equations and the new time reverberation formula introduced is shown and conclusions are drawn.

Determining the influence of visual stimuli on the peception of surround sound using data mining algorithms

Odya Piotr, Czyżewski Andrzej, Kostek Bożena, Smolinski T. 142nd Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer., Fort Lauderdale, USA, 2001

A short description of experiments that aim to determine visual cues influence on the perception of spatial sound is provided in the paper. The earlier stage of the carried out experiments showed that there exists a relationship between the perception of video presented in the screen and sound signals reproduced in a surround system. However, this relationship is dependent on the type of audio-visual signals. Thus a series of subjective tests has been performed on dozens of experts in order to discover these dependencies. The main issue in such experiments is the analysis of the influence of visual cues on the perception of the surround sound. This problem is solved with the application of genetic algorithm and rule searching mechanism to the processing of subjective test results. Some results and conclusions concerning the complexity of the investigated problem are included.

Determination of Influence of Visual Cues on Perception of Spatial Sound

Odya Piotr, Czyżewski Andrzej, Kostek Bożena. 110th Audio Eng. Soc. Conv., Amsterdam, Netherlands, 2001

The paper contains a description of experiments that aim to determine visual cue influence on the perception of spatial sound. Earlier stage of the carried out experiments showed that there exists a relationship between the perception of video presented in the screen and sound signals reproduced in a surround system. However, this relationship is dependent on the type of audio-visual signals. Thus a series of subjective test has been performed on dozens of experts in order to discover these dependencies. The main issue in such experiments is the analysis of the influence of visual cues on the perception of the surround sound. Conclusions concerning the complexity of the investigated problem are included.

Computer simulations of hearing aid acoustical system performance

Kostek Bożena, Szwoch Grzegorz. 142nd Meeting of the Acoustical Society of America, J. Acoust. Soc. Amer., Fort Lauderdale, USA, 2001

The recent developments in the hearing aid technology enabled a number of improvements in hearing aids. This includes advanced signal processing algorithms, better speech intelligibility, miniaturization etc. One of the existing limitations is, however, the problem with providing patient-related characteristics of the acoustical system of a hearing aid. The aim of this paper is to show that using the physical modeling method it is possible to first build a model of the acoustical system of a hearing aid and then to simulate its performance. The waveguide model of the acoustical system of a hearing aid is proposed. Exemplary results of the computer simulations using such a model are presented and compared with some measurement data of existing hearing aid acoustical systems. The model proved to behave similarly to the real system. Conclusions regarding the application of such a method in the fitting process of a hearing aid are included.

Computer Modeling of Acoustical Elements of a Hearing Aid

Szwoch Grzegorz, Kostek Bożena, Czyżewski Andrzej. Archives of Acoustics, vol. 26, No. 3

In this paper, application of computer modeling methods to the process of hearing aid fitting is described. A computer model of the acoustical system of a hearing aid is presented. Exemplary results of the experiments are presented and compared with measurement data. The model proved to behave similarly to the physical system. Further improvements to the model are discussed.

Neural Computation of Direction-Of-Arrival of Sound

Czerniawski Jacek, Czyżewski Andrzej, Królikowski Rafał. 3rd WSEAS (World Scientific and Engineering Academy and Society) Int.Conf. on Neural Network and Applications (NNA '02), Interlaken, Szwjacaria, 2002

 

Some Rules and Methods for Creation of Surround Sound

Czyżewski Andrzej, Kornacki Artur, Odya Piotr. 21st AES Conference, Petersburg, Russia, 2002

The problem of selection of an adequate surround sound life recording and reproduction methods is still open. Alternative methods of organizing this process are discussed in the paper. Some experimental recording sessions employing the 5.1 format were made with the use of various miking techniques and the convolution-based multichannel audio processing algorithm. The results were submitted to some subjective assessments and then compared. Conclusions resulting from performed experiments are derived and discussed.

Making Surround Audio Considering Image Proximity Effect

Czyżewski Andrzej, Kostek Bożena, Odya Piotr. 112th AES Convention, Munich, Germany, 2002

The problem of influencing surround sound perception by video content was addressed employing subjective testing procedures in which experts listened to the sound with- and without video image presence and provided their answers. Results of experiments demonstrated in which cases and how video may affect the localization of virtual sound sources. The obtained data were then analyzed by means of modern techniques of intelligent data exploration and knowledge discovery allowing finding some hidden relations between semantic descriptors of subjective impressions. Finally, basing on the results of data analysis a set of rules concerning mastering of multichannel audio to accompany various types of video content were derived. Some results of this study will be presented and discussed in the paper.

Rough-Neuro Approach to Testing Influence of Visual Cues on Surround Sound Perception

Kostek Bożena. S. K. Pal, L. Polkowski, A. Skowron ed. ROUGH-NEURO COMPUTING: A WAY TO COMPUTING WITH WORDS, Springer Verlag, Series on Artificial Intelligence, 2002

 

Estimation of Non-Stationary Noise for Audio Enhancement by Means of Recurrent Neural Networks

Królikowski Rafał. 3rd WSEAS (World Scientific and Engineering Academy and Society) Int.Conf. on Neural Network and Applications (NNA '02), Interlaken, Szwajcaria, 2002

 

Soft Computing in Acoustics, Applications of Neural Networks, Fuzzy Logic and Rough Sets to Musical Acoustics, Studies in Fuzziness and Soft Computing, vol. 31

Kostek Bożena. Physica Verlag, Heidelberg, New York 1999. (ISBN3-7908-1190-4)

The book presents applications of some selected soft computing methods to acoustics and sound engineering. The aim of this research study is the implementation of soft computing methods to musical signal analysis and to the recognition of musical sounds and phrases. Accordingly, some methods based on such learning algorithms as neural networks, rough sets and fuzzy-logic were conceived, implemented and tested. Additionally, the above-mentioned methods were applied to the analysis and verification of subjective testing results. The last problem discussed within the framework of this book was the problem of fuzzy control of the classical pipe organ instrument. The obtained results show that computational intelligence and soft computing may be used for solving some vital problems in both musical and architectural acoustics.
Contents:
a. Foreword
b. Preface
c. Introduction
d. Some Selected Soft Computing Tools and Techniques: Artificial Neural Networks; Fuzzy Sets and Fuzzy Logic; Rough Sets
e. Preprocessing of Data in Acoustics: Musical Signal Representation; Musical Phrase Analyis; Acquisition of Test Results; Data Discretization
f. Automatics Classification of Musical Instrument Sounds: Uncertainty of Musical Instrument Sound Representation; Feature Vector Extraction; Statistical Properties of Musical Data; Neural Network as a Classifier of Musical Instruments; Rough Set Decision System as a Classifier of Musical Instruments
g. Automatic Recognition of Musical Phrases: Data Acquisition; Parametrization Process; Neural Network as a Classifier of Musical Phrases; Rough Set-Based Classification of Musical Phrases
h. Intelligent Processing of Test Results: Inconsistency of Subjective Assessment Results; Application of Fuzzy Logic to the Processing of Test Results; Application of Rough Sets to the Processing of Test Results; Rough-Fuzzy Method of Test Result Processing
i. Control Applications: Articulation-Related Features in the Pipe Organ Sound; Fuzzy Control of Pipe Organ
j. Conclusions