Site icon Arranger For Hire

We Compared Automatic Audio-to-MIDI Transcription Tools to Aural Transcription by a Music Arranger

How Good is Audio-to-MIDI software?

In the ever-evolving landscape of music production, the need for efficient tools to transcribe audio into MIDI has become increasingly apparent. Musicians and producers often find themselves exploring various software solutions to convert their audio recordings into MIDI data for further manipulation and arrangement. However, the question arises: can these audio-to-MIDI transcription tools truly match the precision and musicality of a skilled human arranger relying on their aural skills and knowledge of musical traditions? In this article, we’ll delve into the realm of audio-to-MIDI transcription tools, comparing their advantages and disadvantages, and ultimately exploring why aural transcription by a competent arranger is still the gold standard.

The Rise of Audio-to-MIDI Transcription Tools

Audio-to-MIDI transcription tools have gained popularity for their ability to automate the process of converting audio signals into MIDI data. These tools leverage advanced algorithms and machine learning to analyze audio recordings and generate MIDI information based on detected pitch, rhythm, and other musical elements. Some popular options in the market include Melodyne, Neural Note, Klangio, Logic Pro’s FlexPitch, AmazingMIDI and WIDI Recognition System.

Automatic Transcription Examples

The file source for this series of examples was originally created in Finale. The Finale midi had been imported to Logic Pro X for a production session. Here’s the audio, bounced down from the MIDI piano track in Logic:

https://arrangerforhire.com/wp-content/uploads/2023/11/Source-Piano.mp3

and how it looks when created in Finale:

Example 1: Melodyne

For this example, we instantiated Melodyne onto the new piano audio track, and did a “Transfer”

We exported MIDI from Melodyne and opened it up in Finale, and it gave this:

Although not bad from a pitch perspective, the timings run over, and the midi would need editing, aside from being split into 2 staves.

Example 2: Flex Pitch (Logic Pro X)

This plugin/app appears to be monophonic only.

Example 3: Neural Note

Neural Note is open source, under constant development, and worth watching. In this example, it transcribed overtones as actual pitches at the default setting, but the app has onboard filtration tools that enable degrees of selectivity. It can be a useful tool to convert audio into midi within a DAW, but requires the operator to go through the same recognition and editing choices that an aural transcriber would while transcribing directly to notation software.
It appears to have a problem discerning duration, as well.
It’s downloadable as a standalone app or plugin in various forms (AU,VST) via GitHub.

Example 4: Klang.io

This looks the best “out of the box,” but like all the others (except monophonic Flex Pitch) transcribed overtones as pitches:

Example 5: Amazing MIDI

We did not test on Amazing MIDI – we’re Mac here, and it’s Windows only freeware, but we did download their published example of another piano piece, which looks like this on import to Finale:

Example 6: WIDI Recognition System

We chose not to test this, given they don’t offer a trial version, and they want minimum $59.90 to purchase and test the plugin.

Advantages of Audio-to-MIDI Transcription Tools

Disadvantages of Audio-to-MIDI Transcription Tools

The Aural Advantage

Despite the advancements in audio-to-MIDI transcription tools, the human ear remains unparalleled in its ability to interpret, discern and understand the nuances of musical structure and performance. A competent arranger possesses the musical knowledge and sensitivity required to accurately capture a performance, recognizing fundamental pitches, adjusting tempo, dynamics, and phrasing in a way that transcription tools cannot replicate.

Why Aural Transcription beats Automatic for Notation

Conclusion

While audio-to-MIDI transcription tools may be helpful in entering monophonic data into a DAW session, when attempting to transcribe polyphonic source material, the innate musicality and interpretative skills of a competent arranger still save time and money when accuracy is important, and are vital when music notation is part of the end product. Aural transcription goes beyond the capabilities of automated tools, reducing errors and editing time and cost.

Exit mobile version