Construction and application of English-Chinese interpretation corpus based on big data

Abstract

Interpreting teaching and research need a large number of real, high-quality interpreting corpus, but the existing interpreting corpus has many shortcomings, such as small scale, single type, and uneven quality. In this paper, we utilize big data technology to build a powerful, easy-to-use and open-sharing English-Chinese interpreting corpus database to provide rich and diverse high-quality interpreting examples for the teaching and research of interpreting. We collect English-Chinese interpreting data of various types, scenarios, topics, and levels from the Internet, TV broadcasts, and other channels, clean, standardize, slice, align, and annotate the data, store the metadata information in XML format, and design and implement the structure, functions, and interfaces of the corpus. This paper mainly introduces the data method, model construction, and application effect of the corpus, including the collection, organization, annotation, storage, management, retrieval, analysis, display, and application of the corpus.

Keywords

big data English-Chinese interpreting corpus construction and application

Get full access to this article

View all access options for this article.

References

Hegde

Gao

Vasa

, et al. Factors affecting interpretation of dental radiographs. Dentomaxillofacial Radiol 2023; 52(2): 20220279.

Everaert

Bronstein

Cannon

, et al. Looking through tinted glasses: depression and social anxiety are related to both interpretation biases and inflexible negative interpretations. Clin Psychol Sci 2018; 6(4): 517–528.

Dinis

Oliva

. A parametrised functional interpretation of heyting arithmetic. Ann Pure Appl Logic 2021; 172(4): 102940.

Xiong

, et al. Interpretable deep learning: interpretation, interpretability, trustworthiness, and beyond. Knowl Inf Syst 2022; 64(12): 3197–3234.

Arenhart

JRB

. Interpreting philosophical interpretations of paraconsistency. Synthese 2022; 200(6): 449.

Giess

Licaros

Kwait

, et al. Live mammographic screening interpretation versus offline same-day screening interpretation at a tertiary cancer center. J Am Coll Radiol 2023; 20(2): 207–214.

Gambini

Pullin

. The Montevideo interpretation of quantum mechanics: a short review. Entropy 2018; 20(6): 413.

Lyu

, et al. Seismic coherence for discontinuity interpretation. Surv Geophys 2021; 42(6): 1229–1280.

Godefroy

Caumon

Laurent

, et al. Structural interpretation of sparse fault data using graph theory and geological rules fault data interpretation. Math Geosci 2019; 51(8): 1091–1107.

10.

Qureshi

Asif

Hassan

, et al. Sentiment analysis of reviews in natural language: roman Urdu as a case study. IEEE Access 2022; 10: 24945–24954.

11.

Lee

Khoong

Zeng

, et al. Evaluation of commercially available machine interpretation applications for simple clinical communication. J Gen Intern Med 2023; 38(10): 2333–2339.

12.

Celecia

Figueiredo

Rodriguez

, et al. Unsupervised machine learning applied to seismic interpretation: towards an unsupervised automated interpretation tool. Sensors 2021; 21(19): 6347.

13.

Poinat

. Quantum mechanics and its interpretations: a defense of the quantum principles. Found Phys 2020; 50(9): 924–941.

14.

. English-Chinese machine translation model based on bidirectional neural network with attention mechanism. J Sens 2022; 2022: 5199248.

15.

AlRegib

. Semi-automatic fault/fracture interpretation based on seismic geometry analysis. Geophys Prospect 2019; 67(5): 1379–1391.

16.

Khaleel

Tavanapong

, et al. IDC: quantitative evaluation benchmark of interpretation methods for deep text classification models. J Big Data 2022; 9(1): 34.

17.

Gao

. Analysis of the needs of English-Chinese translation courses and research on teaching strategies under the background of wireless communication and big data. Wireless Commun Mobile Comput 2021; 2021: 3424658.

18.

Harrison-Trainor

Miller

Montalbán

. Borel functors and infinitary interpretations. J Symbolic Logic 2018; 83(4): 1434–1456.

19.

Dinis

Gaspar

. Hardwiring truth in functional interpretations. Port Math 2023; 80(1-2): 81–105.

20.

Belinsky

. Is David Bohm’s quantum mechanics interpretation irrefutable? Moscow Univ Phys Bull 2018; 73(4): 351–363.

21.

Cui

Wang

. Multi-Scale interpretation model for convolutional neural networks: building trust based on hierarchical interpretation. IEEE Trans Multimed 2019; 21(9): 2263–2276.

22.

Kaye

Kwiatkowski

Khan

, et al. Designing an ECG curriculum for residents: evidence-Based approaches to improving resident ECG interpretation skills. J Electrocardiol 2024; 82: 64–68.

23.

Hattiangadi

Stefánsson

. Radical interpretation and decision theory. Synthese 2021; 199(3-4): 6473–6494.

24.

D'Urzo

Mok

D'Urzo

. Variation among spirometry interpretation algorithms. Respir Care 2020; 65(10): 1585–1590.

25.

Freire

Hamkins

. Bi-Interpretation in weak set theories. J Symbolic Logic 2021; 86(2): 609–634.

26.

Dewar

. Interpretation and equivalence; or, equivalence and interpretation. Synthese 2023; 201(4): 119.

27.

Feng

Krahé

Sumich

, et al. Using event-related potential and behavioural evidence to understand interpretation bias in relation to worry. Biol Psychol 2019; 148: 107746.

28.

Ghazouani

Farah

Solaiman

. A multi-level semantic scene interpretation strategy for change interpretation in remote sensing imagery. IEEE Trans Geosci Rem Sens 2019; 57(11): 8775–8795.

29.

Plink-Björklund

. Applying information theory and Bayesian inference to paleoenvironmental interpretation. Geophys Res Lett 2019; 46(24): 14477–14485.

30.

Liu

Wang

, et al. Cancer SIGVAR: a semiautomated interpretation tool for germline variants of hereditary cancer-related genes. Hum Mutat 2021; 42(4): 359–372.