Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Posts

Future Blog Post

less than 1 minute read

Published: January 01, 2199

This post will show up by default. To disable scheduling of future posts, edit config.yml and set future: false.

Blog Post number 4

less than 1 minute read

Published: August 14, 2015

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 3

less than 1 minute read

Published: August 14, 2014

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 2

less than 1 minute read

Published: August 14, 2013

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

Blog Post number 1

less than 1 minute read

Published: August 14, 2012

This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

基于平行语料库的Tess of the D’Urbervilles三译本四字格的对比研究

Published in 翻译研究与教学, 2022

基于自建的Tess of the D’Urbervilles三译本平行语料库，本研究旨在考察译本间不同的风格及四字格的使用情况。

Recommended citation: 孔德璐, 张继东. (2022). "基于平行语料库的Tess of the D’Urbervilles三译本四字格的对比研究." 翻译研究与教学. 2(2).
Download Paper

机器学习视域下《三国演义》三译本翻译风格对比研究

Published in 大连大学学报, 2023

本文使用机器学习中的分类和聚类方法，对中国四大名著之一的《三国演义》三译本进行翻译风格考察。

Recommended citation: 孔德璐. (2023). "机器学习视域下《三国演义》三译本翻译风格对比研究." 大连大学学报. 44(4).
Download Paper

基于机器学习方法的《德伯家的苔丝》中文译本翻译风格考察

Published in 数字人文研究, 2024

研究使用机器学习中的分类和聚类方法,基于自建平行语料库,考察哈代名作《德伯家的苔丝》中文三译本的翻译风格。

Recommended citation: 孔德璐. (2024). "基于机器学习方法的《德伯家的苔丝》中文译本翻译风格考察." 数字人文研究. 4(1).
Download Paper

ChatGPT人机对话式翻译研究

Published in 上海理工大学学报（社会科学版）, 2024

聚焦ChatGPT人机对话式翻译，分析其译文特征并通过与以谷歌翻译为代表的神经机器翻译质量对比发现二者差距。

Recommended citation: 李梅, 孔德璐. (2024). "ChatGPT人机对话式翻译研究." 上海理工大学学报（社会科学版）.
Download Paper

机器学习视域下政经语篇翻译风格对比研究——以《国富论》中文三译本为例

Published in 北京翻译, 2024

本研究采用机器学习算法，探索政经语篇翻译风格特征研究的新路径。

Recommended citation: 孔德璐. (2024). "机器学习视域下政经语篇翻译风格对比研究——以《国富论》中文三译本为例." 北京翻译. 2.
Download Paper

新质生产力背景下中国网络文学的国际传播与外译挑战——以《斗破苍穹》为例

Published in 翻译与传播, 2024

本文以《斗破苍穹》为个案，探讨中国网络文学在“新质生产力”背景下的国际传播与外译挑战，分析其全球化路径、文化适配过程及未来融合新兴技术的发展可能性。

Recommended citation: 孔德璐, 李宝虎, 何婷. (2024). "新质生产力背景下中国网络文学的国际传播与外译挑战——以《斗破苍穹》为例." 翻译与传播, (2): 63–74.
Download Paper

基于机器学习的政经语篇人机翻译风格研究——以《国富论》中译本为例

Published in 外语导刊, 2025

本文利用机器学习算法，基于自建《国富论》人机多译本平行语料库，通过分类、聚类、特征选择实验，考察人类专家译本与大语言模型机器译本的风格差异。

Recommended citation: 孔德璐. (2025). "基于机器学习的政经语篇人机翻译风格研究——以《国富论》中译本为例." 外语导刊. 48(1).
Download Paper

Decoding Machine Translationese in English-Chinese News: LLMs vs. NMTs

Published in MT Summit 2025 (Technical Track), arXiv preprint, 2025

This study explores Machine Translationese (MTese) in English-to-Chinese news translation using classification and clustering with a five-layer feature set.

Recommended citation: Kong, D., & Macken, L. (2025). "Decoding Machine Translationese in English-Chinese News: LLMs vs. NMTs." MT Summit 2025 (Technical Track), arXiv preprint arXiv:2506.22050.
Download Paper

Can Peter Pan Survive MT? A Stylometric Study of LLMs, NMTs, and HTs in Children’s Literature Translation

Published in CTT Workshop @ MT Summit 2025, arXiv preprint, 2025

A stylometric study of machine vs. human translations of Peter Pan using 447 linguistic features across generic and creative-text dimensions.

Recommended citation: Kong, D., & Macken, L. (2025). "Can Peter Pan Survive MT? A Stylometric Study of LLMs, NMTs, and HTs in Children Literature Translation." CTT Workshop @ MT Summit 2025, arXiv preprint arXiv:2506.22038.
Download Paper

talks

Can ChatGPT Break Through the ‘Fortress’ of Literary Translation? A Computational Stylistic Analysis of Human vs. Machine Translation

Published: October 25, 2023

I went to this conference under the scholarship from Tongji University. Thanks!

Can ChatGPT Break Through the ‘Bastion’ of Literary Translation? A Stylometric Study of Human vs. Machine Translation Variants

Published: March 22, 2024

Research Design:

Comparative analysis of 7 human translations (HT) vs. 14 machine translations (MT)
Target texts: Two canonical literary essays
Methodology: Text similarity algorithms for stylometric measurement

Detecting Language Features of ‘Machine-Translationese’ through Machine Learning: Original Chinese News Text vs. Machine-Translated Chinese News Text

Published: June 13, 2024

Abstract:
This presentation investigates the linguistic features of “Machine-Translationese” (MTese) in English-to-Chinese machine-translated news texts. Through machine learning approaches, we identify and quantify distinctive characteristics that differentiate machine-translated Chinese news texts from originally authored Chinese news texts. Our methodology employs clustering techniques to visualize these differentiating features, revealing systematic patterns in MTese.

Comparative Stylistic Analysis of Human vs. Machine Translation in Economic Discourse: A Machine Learning Approach to ‘The Wealth of Nations’ E-C Translations

Published: July 01, 2024

I attended this conference under the scholarship of Tongji-IPRRFSS. Thanks!

Can Peter Pan Survive MT? A Stylometric Study of LLMs, NMTs, and HTs in Children’s Literature Translation

Published: June 27, 2025

This study evaluates the performance of machine translations (MTs) versus human translations (HTs) in English-to-Chinese children’s literature translation (CLT) from a stylometric perspective. A Peter Pan corpus was constructed, including 21 translations: 7 HTs, 7 large language model translations (LLMs), and 7 neural machine translation outputs (NMTs). The analysis uses both a generic feature set (lexical, syntactic, readability, n-gram features) and a creative text translation (CTT-specific) feature set (repetition, rhythm, translatability, miscellaneous), totaling 447 linguistic features.

Stylometric analysis using classification and clustering techniques reveals that, for generic features, HTs and MTs differ significantly in conjunction word distributions and the ratio of 1-word-gram-YiYang. NMTs and LLMs differ in descriptive word usage and adverb ratios. For CTT-specific features, LLMs outperform NMTs in distribution and align more closely with HTs in stylistic characteristics, highlighting the potential of LLMs in CLT.

Comments: 19 pages, 8 figures, 4 tables. Accepted at the 2nd Workshop on Creative-text Translation and Technology, co-located with MT Summit 2025. Official paper may later be accessed from ACL Anthology.

PPT: Link

Decoding Machine Translationese in English-Chinese News: LLMs vs. NMTs

Published: June 28, 2025

This study explores Machine Translationese (MTese)—the linguistic peculiarities of machine translation outputs—focusing on the under-researched English-to-Chinese language pair in news texts. We construct a large dataset consisting of 4 sub-corpora and employ a comprehensive five-layer feature set. A chi-square ranking algorithm is applied for feature selection in both classification and clustering tasks. Our findings confirm the presence of MTese in both Neural Machine Translation systems (NMTs) and Large Language Models (LLMs). Original Chinese texts are nearly perfectly distinguishable from both LLM and NMT outputs. Notable linguistic patterns in MT outputs are shorter sentence lengths and increased use of adversative conjunctions. Comparing LLMs and NMTs, we achieve approximately 70% classification accuracy, with LLMs exhibiting greater lexical diversity and NMTs using more brackets. Additionally, translation-specific LLMs show lower lexical diversity but higher usage of causal conjunctions compared to generic LLMs. Lastly, we find no significant differences between LLMs developed by Chinese firms and their foreign counterparts.

Comments: 14 pages, 5 figures, 6 tables. Accepted in MT Summit 2025, Research: Technical track. Official version may be accessed later in the ACL Anthology.

PPT: Download here

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015

This is a description of a teaching experience. You can use markdown like any other post.

Shijie Liu

Sitemap

Pages

Posts

portfolio

publications

talks

teaching