Corticokinematic coherence (CKC) between magnetoencephalographic and movement signals using an accelerometer is useful for the functional localization of the primary sensorimotor cortex (SM1). However, it is difficult to determine the tongue CKC because an accelerometer yields excessive magnetic artifacts. Here, we introduce a novel approach for measuring the tongue CKC using a deep learning-assisted motion capture system with videography, and compare it with an accelerometer in a control task measuring finger movement. Twelve healthy volunteers performed rhythmical side-to-side tongue movements in the whole-head magnetoencephalographic system, which were simultaneously recorded using a video camera and examined using a deep learning-assisted motion capture system. In the control task, right finger CKC measurements were simultaneously evaluated via motion capture and an accelerometer. The right finger CKC with motion capture was significant at the movement frequency peaks or its harmonics over the contralateral hemisphere; the motion-captured CKC was 84.9% similar to that with the accelerometer. The tongue CKC was significant at the movement frequency peaks or its harmonics over both hemispheres. The CKC sources of the tongue were considerably lateral and inferior to those of the finger. Thus, the CKC with deep learning-assisted motion capture can evaluate the functional localization of the tongue SM1.