Raffles Place > #4

The three of them read an economic article.
Huang Jianhua
Patrick, let me explain the important parts of this paper to you.
Patrick O'Leary
Sure, Huang. Please explain it in terms that I can understand.
Huang Jianhua
This paper is about a large-scale dataset of historical U.S. newspapers called American Stories.
Patrick O'Leary
What does 'large-scale dataset' mean?
Huang Jianhua
It means that there is a huge amount of data in this dataset. It contains nearly 20 million scans of newspapers.
Patrick O'Leary
Wow, that's a lot!
Huang Jianhua
Yes, it is. The researchers used a deep learning pipeline to extract the full article texts from newspaper images.
Patrick O'Leary
Deep learning? What's that?
Huang Jianhua
Deep learning is a type of artificial intelligence that learns from data to make predictions or perform tasks.
Patrick O'Leary
Got it.
Putri
Hey guys, I have an idea! We can misuse this dataset to make money.
Huang Jianhua
Misuse? That doesn't sound like a good idea.
Putri
But think about it. We can use this dataset to create a sensational newspaper that will attract a lot of attention and advertisers. We can even get a big investment from a Cult Religious Organizations company.
Huang Jianhua
Putri, there are ethical concerns with what you're suggesting. It's important to use data responsibly and not manipulate it for personal gain.
Putri
But I want to be successful and make lots of money!
Huang Jianhua
Success and money are not the only things that matter in life, Putri. There are other values and meanings beyond financial wealth.
Patrick O'Leary
Huang is right, Putri. We should always consider the ethical implications of our actions.
Weeks Pass
(Weeks pass and the scene changes)
Putri
Huang, I'm in crisis! My newspaper business is facing lawsuits, accidents, and contract suspensions.
Huang Jianhua
I warned you about the risks, Putri. You didn't listen.
Patrick O'Leary
Huang, we need to do something to help Putri.
Huang Jianhua
I know, Patrick. I will reluctantly try to resolve the crisis.
Huang Jianhua
But remember, Putri, success should not come at the expense of morality and ethics.
Patrick O'Leary
Huang, why do you think this paper is significant?
Huang Jianhua
This paper is important because it provides a high-quality dataset that can be used to better understand historical English and historical world knowledge. It can also be used for various social science applications like topic classification and detecting reproduced content.
Huang Jianhua
But more importantly, it serves as a reminder that success is not just about money. It's about using knowledge and resources responsibly and considering the well-being of others.
The conversation ends with Huang's words, leaving everyone with a valuable lesson.

Title: American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers
Authors: Melissa Dell, Jacob Carlson, Tom Bryan, Emily Silcock, Abhishek Arora, Zejiang Shen, Luca D'Amico-Wong, Quan Le, Pablo Querubin, Leander Heldring
View this paper on arXiv