Decibel Analysis for Research in Teaching (DART) is a machine-learning-based algorithm that can analyze audio recorded class sessions with minimal costs and without need for human observers for measuring the use of teaching strategies beyond traditional lecture in undergraduate STEM courses. DART characterizes class session noise levels into three different modes: single voice, multiple voice, and no voice. Table 1 below shows common activities that are typically associated with each of these three modes. Based on our testing, DART makes an accurate prediction 87.5% of the time and tends to underestimate the amount of time spent in multiple and no voice. More on the types of errors DART makes in the section, What kinds of errors does DART make.
Predicted DART Mode | Human Annotation Code |
---|---|
Single Voice | Lecture: Instructor or other individual speaking to class as a whole |
Question/Answer: Individual student or instructor responding to class as a whole | |
Video: Any audio/video clip played while students are sitting and watching/listening | |
Multiple Voice | Discussion: Small groups of students talking with multiple voices talking in the classroom at once |
Transition: Breaks, movement around classroom, time between activities when students might chat or make loud movements | |
No Voice | Silent: Students working individually (no one talking) such as a minute paper, think portion of a think pair share, quiz |
Active learning pedagogies of varying quality have been repeatedly demonstrated to produce superior learning gains with large effect sizes compared to lecture-based pedagogies. However, it is unclear what proportion of STEM instructors in the U.S. and internationally regularly use teaching strategies beyond lecture. DART meets this need for a measurement approach that could systematically inventory the presence of active learning not only in one course but also across dozens of departmental courses, multiple STEM departments, and thousands of colleges and universities.
We are an interdisciplinary team drawn from more than two-dozen San Francisco Bay Area community colleges and universities led by the Science Education Partnership and Assessment Laboratory (SEPAL), the Center for Computing in Life Sciences (CCLS) at San Francisco State University, and research collaborators at Pacific Lutheran University. In total, the research team includes over 70 collaborators!
While we developed DART to analyze noise patterns in college science courses, we can think of many other applications for this tool. If you are an individual instructor, a K-12 teacher, a faculty developer, a program coordinator, or in another position in which you think DART might be useful to you, we would love to hear about it! Please use the button below to contact us and tell us how DART might be useful in your context.
Your DART Report will include three main parts:
DART makes a conservative estimate of the amount of time spent in modes Multiple Voice and No Voice, the two DART modes that correlate most directly with student-centered or active learning activities. The pattern of errors DART makes for each mode showing correct inclusions (hits), incorrect exclusions (misses), correct exclusions (correct rejections), and incorrect inclusions (false alarms), can be seen in Figure 1 below.
More specifically, below we list some potential coding misclassifications that you may encounter:
Examples of diverse classroom situations | Human annotation code | Expected DART mode based on annotation | Actual DART prediction | |
---|---|---|---|---|
(A) False Negative for Single Voice | Recorder is too far away from individual talking | Lecture with Q/A | Single Voice | No Voice |
Long pauses where instructor is silent (e.g. while writing on the board or working with equipment) | Lecture with Q/A | Single Voice | No Voice | |
Student closest to audio recorder talking inappropriately while instructor is talking | Lecture with Q/A | Single Voice | Multiple Voice | |
Instructor talking during a video with audio or video with loud music | Video | Single Voice | Multiple Voice | |
Significant ambient or outside noise (e.g. loud fan, outside hall conversations, etc.) during single voice activity | Lecture with Q/A | Single Voice | Multiple Voice | |
(B) False Positive for Single Voice | Silent work in which an instructor or student speaks extraneously | Silent | No Voice | Single Voice |
Significant ambient or outside noise (e.g. loud fan, outside hall conversations, etc.) during silent activity | Silent | No Voice | Single Voice | |
Small group or pair discussions in a very small (e.g., under 4 student) class | Discussion | Multiple Voice | Single Voice | |
Small or pair discussions in which there is delayed/minimal student discussions | Discussion | Multiple Voice | No/Single Voice | |
(C) Errors Concerning Other Modes | Break during class – students remain in classroom | Transition | Multiple Voice | Multiple Voice |
Students left classroom for small group activity | Silent | No Voice | No Voice | |
Choral response to instructor questions (lasting more than 15s) | Lecture with Q/A | Single Voice | Single Voice |
(A) Classroom situations with a human annotation corresponding to Single Voice but DART prediction of Multiple or No Voice. (B) Classroom situations with a human annotation corresponding to Multiple or No Voice but DART prediction of Single Voice. (C) Classroom situations where predicted and actual modes match, but vary based on quality of activity.
This may be an artifact of recorder placement (if the recorder is closer to the instructor than the students) or using pre-recorded audio to run the decibel analysis. Recording devices have a limited range, so samples at the high end and low end of that range will be given similar decibel values.
DART was designed to be a conservative estimate of the amount of time spent in the Multiple Voice and No Voice modes, so errors can occur when a recording is nearly all Multiple Voice (or nearly all No Voice, as during an exam). More specifically, because DART was designed to work with a variety of sound recording equipment and contexts, DART creates its predictions using sound loudness levels and sound variability levels normalized to the average loudness and variability level of the entire class session. If a class session includes a variety of modes, the relative sound patterns of these modes will contrast with each other and have different normalized loudness and variability levels. In contrast, if nearly all the class session is Multiple Voice, there is no contrast and no way for DART to tell the "true" mode. Given that DART was designed to be a conservative estimate of active learning, DART defaults to presuming that such "monomodal" recordings are all in Single Voice.
In a recording of a small class or meeting, one with under 16 participants or so, a human listener can often still pick out the cadence of individual voices during pair or group discussions. As DART was designed to be a conservative estimate of the amount of time spent in the Multiple Voice and No Voice modes, DART tends to predict those times to be Single Voice.
The analysis time depends on the length of audio file as well as your computer’s processing speed and the strength of your internet connection. On a general use laptop computer, a 60 minute audio file will take approximately 5 minutes to complete analysis. We thank you for your patience.
DART accepts files in MP3, MP4, M4A, M4P, OGG, and WAV format.
DART can analyze MP4 and M4P video types. If you have a video file in another format, you can extract the audio from it using any number of different programs online.
The percentages DART calculates are not valid if your recording includes noise from before class, after class, or breaks. Therefore if you'd like percentage data, please clip your file to exclude time before or after class or during breaks. Many free downloadable programs are available that you can use to clip your files, including Audacity and iTunes.
Our early testing indicates that DART will run in Chrome, and Firefox but not Safari. While either works, Chrome is best supported. Browsers on phones and tablets are not supported.
For our initial study of DART, the Sony Digital Voice Recorder ICDPX333 was used. However, other recording devices such as basic video cameras and iPhone Voice Memo have been used to record audio samples with similar results. However, microphones that selectively record sound in a particular direction (such as "cardioid" microphones) may not work as well, because they exclude the background noise. Therefore, while we anticipate that most audio recording devices should give usable DART results, we cannot guarantee it, so we recommend that you try yours out and run DART analysis on it to see if it is effective before doing any long-term recording projects.
In our experience, it was most effective to place the recording device at the front of the classroom (perhaps on a podium or table) with the recording device facing the audience. It is important to note that the recording device must be able to hear both the instructor’s voice and the students’ collective voice.
If you have many files or instructors to analyze, please contact us with the button below to discuss a potential collaboration.
We are doing our best to make DART accessible to everyone who would like to use it. If you are having trouble, make sure you are using the supported browsers, either Firefox or Chrome. You can also first try closing other programs and see if DART works. If you are still having trouble, please share your issue with us by contacting us using the button below. It will be most useful if, to the extent you can, you can share information about what computer and browser you are using and what else you are running. Thank you for your patience as we get this tool up and running!