CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조

Overview

CJK computer science terms comparison

GitHub Netlify Creative Commons License

This repository contains the source code of the website. You can see the website from the following link:

Greater China, Japan, and Korea, the so-called Sinosphere (漢字文化圈; literally: "Chinese character cultural sphere"), have borrowed many concepts through Sinoxenic vocabularies from the West since the modern era. Some of them have their own translations, but some have imported translations from neighboring countries. In some translations, both native and foreign stems are combined. As a result, Sinosphere countries share a lot of words, but to some extent they have their own parts. And this is no different in computer science translations.

This page contains comparison tables of how computer science terms, mostly derived from English, are translated and called in different regions of Sinosphere.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Introduction

Cognates

Cognates are words that are derived from one side or share a common etymology.

For example, the English word computer and the Korean word 컴퓨터, the Japanese word 計算科学 (keisan kagaku) and the Chinese word 計算科學 (jìsuàn kēxué), that are both meaning computational science, are cognates.

Cognates are indicated by the same colored border.

Calque (loan translation)

Calque is a word or phrase borrowed from another language by literal word-for-word or root-for-root translation.

For example, the Chinese word 軟件 is a translation of the English word software, which translates the English words soft (ruǎn; soft or flexible) and ware (jiàn; clothes or item) respectively.

Matching words/roots between languages in this way are underlined with the same color & shape.

Homophonic translations

For a root transcribed from a foreign word, the original word is displayed on the root.

For example, as the Japanese word コンピュータ (konpyu-ta) is a transcription of English word computer, it is displayed like: コンピュータcomputer.

Romanized pronunciation

The pronunciation of each word is shown in Latin letters in parentheses below the word. The transcription system for each language is as follows:

Mandarin (China & Taiwan) : Hanyu Pinyin

Cantonese (Hong Kong) : Jyutping (Linguistic Society of Hong Kong Cantonese Romanization Scheme)

Japanese : Hepburn romanization

Korean : Revised Romanization of Korean

Basic terms

Show table.

Units

Show table.

Fields of study

Show table.

Computer programming

Show table.

Tools

Show table.

Theory of computation

Show table.

*[CJK]: Chinese, Japanese, and Korean languages

Comments
  • Fix explanation for ‘function’

    Fix explanation for ‘function’

    1. ja 関数 cognates with zh-HK 函數.

    Wiktionary: 日本語としての関数はもともと「函数」と書く。これは英語 "function" の中国における訳語である函数 (hánshù) をそのまま輸入したものであるが、「函」が漢字制限による常用漢字に含まれなかったため1950年代以降、同音の「関」へと書き換えがすすめられた。

    Besides, ‘function’ appears in another part of this project, and that part is correct: https://github.com/dahlia/cjk-compsci-terms/blob/1fcbf1daad0912b78653148a496483dd45e8c412/tables/paradigms.yaml#L146-L155

    2. 式 in zh-TW means ‘style’, but is mistakenly written as ‘number’.

    承上而誤

    opened by ayaka14732 3
  • 'data': add 정보(情報)

    'data': add 정보(情報)

    Maybe a native speaker can review this but I think 자료 is more like 'material' whereas 정보 is more like 'information'. Both are a sensible translation of 'data'.

    opened by graup 2
  • Add languages for ToC-generated links

    Add languages for ToC-generated links

    For non-English pages, the ToC extension generates its links as <a href="#{contents}">. When users click them, it routes to the English page. This commit will make them as <a href="{lang}#{contents}">.

    opened by SeokminHong 1
  • Dark mode / 深色模式

    Dark mode / 深色模式

    The current style has the only color scheme with white background. It would be good to have a more color scheme with black background as well for ones who love dark/night mode.

    As the existing color scheme already depends on Open Color, choosing a set of colors for the new color scheme would be not that difficult.

    See also the prefers-color-scheme CSS media feature.

    help wanted / 徵求幫助 good first issue / 好第一個議題 
    opened by dahlia 0
  • Vietnamese (`vi`) terms

    Vietnamese (`vi`) terms

    Although I am totally ignorant about Vietnamese, I would appreciate if someone help me.

    One thing to consider is that it would require the repository name and the website domain name to be renamed from cjk-compsi-terms to cjkv-compsi-terms, but I'm going to do all of bits and pieces for it if we finally manage it.

    enhancement / 改進 help wanted / 徵求幫助 
    opened by dahlia 0
  • North Korean (`ko-KP`) terms

    North Korean (`ko-KP`) terms

    If there are enough resources about North Korean terms we could add a new column for North Korean (ko-KP). This issue tracks the useful resources for that.

    See also my tweet about this research. Please report more resources.

    enhancement / 改進 help wanted / 徵求幫助 
    opened by dahlia 0
  • Regional variant characters should be marked

    Regional variant characters should be marked

    Sometimes, an existing character can be a regional variant of another character due to its coincidental reading in a region. However, often such regional variants cannot convey the intention in other regions.

    For example, 才 (talent) is used often instead of 歳/歲 (age) in Japan due to its coincidental reading in Japanese: さい (sai). However, those two characters don't share the same reading in other regions, e.g., 才 is 재 (jae) & 歲 is 세 (se) in Korean.

    If such regional variants replaced orthodox characters in a regional standard they should be normalized to orthodox ones in keys to group cognates, whereas their readings from other languages should be read as variant characters which aren't normalized.

    For example, since 函 (box) is unlisted in Japanese jōyō kanji (常用漢字), 函 (かん; kan) is replaced by 関/關 (relation) which has the same reading. That's why 函數 (function) is written as 関数/關數 in Japanese. Still, as those two characters are read differently in other regions (e.g., 함 ham vs 관 guan in Korean), such uses need to be marked in the data set and pronunciations in <ruby> should show the proper reading (e.g., 함 instead of 관).

    See also: https://github.com/dahlia/cjk-compsci-terms/pull/9.

    bug / 錯誤 
    opened by dahlia 0
  • Concurrent programming / 倂發計算

    Concurrent programming / 倂發計算

    • [x] Concurrent programming
    • [ ] Parallel computing
    • [x] Concurrency
    • [x] Lock
    • [x] Synchronization
    • [ ] Asynchronous programming
    • [ ] Synchronous programming
    • [x] Transactional memory
    • [ ] Multi-core

    Thanks @item4 & @disjukr for their suggestions.

    enhancement / 改進 
    opened by dahlia 0
Owner
Hong Minhee (洪 民憙)
A software engineer from Seoul. An advocate of F/OSS, Open Web, and Cypherpunk. Hack into East Asian languages.
Hong Minhee (洪 民憙)
Facilitating the design, comparison and sharing of deep text matching models.

MatchZoo Facilitating the design, comparison and sharing of deep text matching models. MatchZoo 是一个通用的文本匹配工具包,它旨在方便大家快速的实现、比较、以及分享最新的深度文本匹配模型。 ?? News

Neural Text Matching Community 3.7k Jan 2, 2023
Facilitating the design, comparison and sharing of deep text matching models.

MatchZoo Facilitating the design, comparison and sharing of deep text matching models. MatchZoo 是一个通用的文本匹配工具包,它旨在方便大家快速的实现、比较、以及分享最新的深度文本匹配模型。 ?? News

Neural Text Matching Community 3.4k Feb 18, 2021
Tools, wrappers, etc... for data science with a concentration on text processing

Rosetta Tools for data science with a focus on text processing. Focuses on "medium data", i.e. data too big to fit into memory but too small to necess

null 207 Nov 22, 2022
Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology (EARIST)

?? Coeus - EARIST A.C.E ?? Coeus is an Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology,

Dids Irwyn Reyes 3 Oct 14, 2022
Using context-free grammar formalism to parse English sentences to determine their structure to help computer to better understand the meaning of the sentence.

Sentance Parser Executing the Program Make sure Python 3.6+ is installed. Install requirements $ pip install requirements.txt Run the program:

Vaibhaw 12 Sep 28, 2022
The ability of computer software to identify words and phrases in spoken language and convert them to human-readable text

speech-recognition-py Speech recognition is the ability of computer software to identify words and phrases in spoken language and convert them to huma

Deepangshi 1 Apr 3, 2022
Stuff related to Ben Eater's 8bit breadboard computer

8bit breadboard computer simulator This is an assembler + simulator/emulator of Ben Eater's 8bit breadboard computer. For a version with its RAM upgra

Marijn van Vliet 29 Dec 29, 2022
metedraw is a project mainly for data visualization projects of Atmospheric Science, Marine Science, Environmental Science or other majors

It is mainly for data visualization projects of Atmospheric Science, Marine Science, Environmental Science or other majors.

Nephele 11 Jul 5, 2022
PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

PRAnCER (Platform enabling Rapid Annotation for Clinical Entity Recognition) is a web platform that enables the rapid annotation of medical terms within clinical notes. A user can highlight spans of text and quickly map them to concepts in large vocabularies within a single, intuitive platform.

Sontag Lab 39 Nov 14, 2022
The coda and data for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)

We propose a hierarchical core-fringe learning framework to measure fine-grained domain relevance of terms – the degree that a term is relevant to a broad (e.g., computer science) or narrow (e.g., deep learning) domain.

Jie Huang 14 Oct 21, 2022
Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Context Terms

LESA Introduction This repository contains the official implementation of Locally Enhanced Self-Attention: Rethinking Self-Attention as Local and Cont

Chenglin Yang 20 Dec 31, 2021
Simple yet efficient tool used to check and sort tokens in terms of there validation.

Discord Token Checker Simple yet efficient tool used to check and sort tokens in terms of there validation.When the program is done,go to the "output"

Robotnik 15 Dec 27, 2022
The blancmange curve can be visually built up out of triangle wave functions if the infinite sum is approximated by finite sums of the first few terms.

Blancmange-curve The blancmange curve can be visually built up out of triangle wave functions if the infinite sum is approximated by finite sums of th

Shankar Mahadevan L 1 Nov 30, 2021
Search for terms(word / table / field name or any) under Snowflake schema names

snowflake-search-terms-in-ddl-views Search for terms(word / table / field name or any) under Snowflake schema names Version : 1.0v How to use ? Run th

Igal Emona 1 Dec 15, 2021
Implemented a Google Maps prototype that provides the shortest route in terms of distance

Implemented a Google Maps prototype that provides the shortest route in terms of distance, the fastest route, the route with the fewest turns, and a scenic route that avoids roads when provided a source and destination. The algorithms used were DFS, BFS, A*, and Iterative Depth First Search.

null 1 Dec 26, 2021
Reggy - Regressions with arbitrarily complex regularization terms

reggy Regressions with arbitrarily complex regularization terms. Currently suppo

Kim 1 Jan 20, 2022
Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

federated is the source code for the Bachelor's Thesis Privacy-Preserving Federated Learning Applied to Decentralized Data (Spring 2021, NTNU) Federat

Dilawar Mahmood 25 Nov 30, 2022
collection of interesting Computer Science resources

collection of interesting Computer Science resources

Kirill Bobyrev 137 Dec 22, 2022
These data visualizations were created for my introductory computer science course using Python

Homework 2: Matplotlib and Data Visualization Overview These data visualizations were created for my introductory computer science course using Python

Sophia Huang 12 Oct 20, 2022