CS425FZ Java

Java Python CS425FZ (Audio & Speech Processing)
Assignment 1
(value 20%)
Released date: Tuesday 26
th
November 2024
Due date: Sunday 15
th
December 2024 at 23:59
This is an open-book, graded assignment. Please cite all references as comments in your
submissions. You cannot directly reuse a solution from online sources or AI. You must not engage
with another student, in person or electronically (via phone, social media, etc.), to secure
assistance with this Assignment. If you do so (even for only one of the questions), you will receive
an automatic failure (0%), and it will also be reported to the Executive Vice-Dean of MIEC and/or
Maynooth University Plagiarism board. We will perform similarity checks on submitted
assignments to check for collaborative efforts. The lecturer reserves the right to interview you
about your submission in special cases. It should be mentioned that the Turnitin tool provided in
Moodle can detect AI-generated context.
The first assignment is to use the programs in Java, processing, Python, or Octave/MATLAB to
prepare a narrative on digital waveforms and spectral analysis using the FFT and the Spectrogram
to demonstrate your knowledge of how they work. Make sure that each plot can clearly illustrate
the shape of the waveform, i.e. if you have to zoom in to get this, do so. A thick coloured block is
not acceptable. The results from your plotting should be placed into a PowerPoint presentation,
and along with the plot, a sound file should be inserted into the page (it should be imported as
mp3 to save space). The documents should also show on the following slide to each plot the
programming scripts to generate the wave and its graph. The graphs should have titles, labelled
axes and a caption in the document (e.g. Figure 1, Figure 2).
Waveforms
1. Generate and plot one example of the waveform of a sinusoid at a frequency, amplitude,
and phase of your choice. Show the waveform from time t=0. Select the frequency of the
sinewave from the set of musical notes
https://homes.luddy.indiana.edu/donbyrd/Teach/MusicalPitchesTable.htm
Make sure to give the frequency of the wave in the title of the plot.
2. Generate and plot an example of waveforms composed of sinusoids at harmonically
related frequencies to create either a sawtooth wave, a square wave or a triangle wave.
3. Read in a wav file of an “effect”/natural sound and plot only 20 seconds of it.
4. Read in a wav file of a Speech utterance (it could be from the web or recorded by yourself)
and plot it (approx. 2-5 seconds), put the text of the utterance in the title of the plot. Page 2 of 2

Fourier transform
5. Plot the magnitude of the Fourier transform (FFT) of a signal composed of more than one
sinusoid of different frequencies and amplitudes using a rectangular window. Use an FFT
length of N=256 and then N=2048.
6. Plot the magnitude of the Fourier transform (FFT) of the same signal composed of more
than one sinusoid of different frequencies and amplitudes using a Hanning window. Use an
FFT length of N=256 and then N=2048.
7. Record at least 1 second of you saying any vowel sound using Audacity or an equivalent
software. Use the editor to retain only the steady portion of the vowel waveform. Plot the
magnitude of the Fourier transform of this, picking a suitable value for N (e.g. 256, 512,
1024 2048) so that it is easy to identify at least two formant peaks form the spectrum.
Spectrogram
8. Plot the spectrogram of the speech waveform you used earlier for a short window N=256
and a long window N=1024. Identify the voiced and unvoiced speech in the plot.
9. Plot the spectrogram of a sound effect that has distinctive frequency components, e.g. a
bird sound, a chainsaw, a car starting, clock strike. Pick an appropriate window length for
the frequency components to be clearly displayed. Make sure to mention the window
length in the title of the plot
10. Plot the spectrogram of a short drum loop of your choice with N=256 and N=2048 to show
that the shorter window means a better time resolution, and thus, the points in time of the
drum hits are easier to discern. Point this out in the figure in its caption.
Sources of sound files

Use Audacity to shorten the sound file to the length required.

Note: the Java and processing code only handles 16-but mono wav files properly. If your file is not
in that format just use Audacity to split a stereo track to mono and export it as a 16-bit wav         

Delphi 12.3 作为一款面向 Windows 平台的集成开发环境,由 Embarcadero Technologies 负责其持续演进。该环境以 Object Pascal 语言为核心,并依托 Visual Component Library(VCL)框架,广泛应用于各类桌面软件、数据库系统及企业级解决方案的开发。在此生态中,Excel4Delphi 作为一个重要的社区开源项目,致力于搭建 Delphi 与 Microsoft Excel 之间的高效桥梁,使开发者能够在自研程序中直接调用 Excel 的文档处理、工作表管理、单元格操作及宏执行等功能。 该项目以库文件与组件包的形式提供,开发者将其集成至 Delphi 工程后,即可通过封装良好的接口实现对 Excel 的编程控制。具体功能涵盖创建与编辑工作簿、格式化单元格、批量导入导出数据,乃至执行内置公式与宏指令等高级操作。这一机制显著降低了在财务分析、报表自动生成、数据整理等场景中实现 Excel 功能集成的技术门槛,使开发者无需深入掌握 COM 编程或 Excel 底层 API 即可完成复杂任务。 使用 Excel4Delphi 需具备基础的 Delphi 编程知识,并对 Excel 对象模型有一定理解。实践中需注意不同 Excel 版本间的兼容性,并严格遵循项目文档进行环境配置与依赖部署。此外,操作过程中应遵循文件访问的最佳实践,例如确保目标文件未被独占锁定,并实施完整的异常处理机制,以防数据损毁或程序意外中断。 该项目的持续维护依赖于 Delphi 开发者社区的集体贡献,通过定期更新以适配新版开发环境与 Office 套件,并修复已发现的问题。对于需要深度融合 Excel 功能的 Delphi 应用而言,Excel4Delphi 提供了经过充分测试的可靠代码基础,使开发团队能更专注于业务逻辑与用户体验的优化,从而提升整体开发效率与软件质量。 资源来源于网络分享,仅用于学习交流使用,请勿用于商业,如有侵权请联系我删除!
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值