Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translation: Library Carpentry OpenRefine Episode 3 #158

Open
joelnitta opened this issue Aug 5, 2021 · 0 comments
Open

Translation: Library Carpentry OpenRefine Episode 3 #158

joelnitta opened this issue Aug 5, 2021 · 0 comments

Comments

@joelnitta
Copy link
Member

Translation: Library Carpentry OpenRefine Episode 3

Original lesson: https://librarycarpentry.org/lc-open-refine/03-working-with-data/index.html


  • PO fileの編集についてはREADMEをご参照ください
  • あなたが翻訳し始めたと分かるように、翻訳を始める前にこのイシューを自分に割り振って(assign)、あるいはコメントを書いて下さい。
  • 翻訳が終わったら、プルリクエストを開いて、このイシューを参照・リンクして下さい。

i18n/po/lc-open-refine.ja.po

Lines 707 to 1007 in bb59a56

#: lc-open-refine/_episodes/03-working-with-data.md:1
# Front Matter
msgid "---\n"
"title: \"Layout of OpenRefine, Rows vs Records\"\n"
"teaching: 10\n"
"exercises: 5\n"
"questions:\n"
"- \"How is data organised in OpenRefine?\"\n"
"- \"How do I access options to amend data in OpenRefine?\"\n"
"- \"What is the difference between Rows and Records in OpenRefine?\"\n"
"- \"How do I work with single cells that contain multiple values in a list?\"\n"
"objectives:\n"
"- Locate controls for navigating data in OpenRefine\n"
"- Find options to work with data through the OpenRefine dropdown menus\n"
"- Split cells which contain multiple bits of data so that each piece of data is in its own cell\n"
"keypoints:\n"
"- \"OpenRefine uses rows and columns to display data\"\n"
"- \"Most options to work with data in OpenRefine are accessed through a drop down menu at the top of a data column\"\n"
"- \"When you select an option in a particular column (e.g. to make a change to the data), it will affect all the cells in that column\"\n"
"- \"OpenRefine has a Records mode which links together multiple rows into a single record\"\n"
"- \"Split and join multi-valued cells to modify the individual values within them\"\n"
"- \"When creating multi-valued cells in your data, choose a separator that will not appear in the data values\"\n"
"---"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:23
# header
msgid "## The layout of OpenRefine"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:24
msgid "OpenRefine displays data in a tabular format. Each row will usually represent a 'record' in the data, while each column represents a type of information. This is very similar to how you might view data in a spreadsheet or database. As with a spreadsheet, the individual bits of data live in 'cells' at the intersection of a row and a column."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:26
msgid "OpenRefine only displays a limited number of rows of data at one time. You can adjust the number choosing between 5, 10 (the default), 25 and 50 at the top left of the table of data. You can navigate through the records by using the previous/next/first/last navigation options at the top right of the table of data."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:28
# header
msgid "## Working with data in OpenRefine"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:29
msgid "Most options to work with data in OpenRefine are accessed from drop down menus at the top of the data columns. When you select an option in a particular column (e.g. to make a change to the data), it will affect all the cells in that column. If you want to make changes across several columns, you will need to do this one column at a time."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:31
# header
msgid "## Rows and Records"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:32
msgid "OpenRefine has two modes of viewing data: 'Rows' and 'Records'. At the moment we are in Rows mode, where each row represents a single record in the data set - in this case, an article. In Records mode, OpenRefine can link together multiple rows as belonging to the same Record."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:34
# header
msgid "### Splitting Cells"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:36
msgid "To see how this works in practice we can split author names into separate cells. If you look at the Author column you should be able to see that there are multiple names in each cell separated by the pipe symbol ( \\| )."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:38
msgid "To work with the author names effectively in OpenRefine, we need to have each name in an individual cell. To split the names into their own cells, we can use a `Split multi-valued cells` function:"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:40
#: lc-open-refine/_episodes/03-working-with-data.md:62
# unordered list
msgid "* Click the dropdown menu at the top of the Author column"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:41
# unordered list
msgid "* Choose `Edit cells->Split multi-valued cells`"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:42
# unordered list
msgid "* In the prompt type the ( \\| ) symbol and click `OK`"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:43
# unordered list
msgid " * Note that the rows are still numbered sequentially"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:44
# unordered list
msgid "* Click the `Records` option to change to Records mode"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:45
# unordered list
msgid " * Note how the numbering has changed - indicating that several rows are related to the same record"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:47
msgid " ![rows](../assets/img/rows.png)\n"
" ![records](../assets/img/records.png)"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:50
msgid "Now that we can split multi-valued cells, we'll cover how to join them back together."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:52
# header
msgid "### Joining Cells"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:54
msgid "A common workflow with multi-valued cells is"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:56
# unordered list
msgid "- split multi-valued cells into individual cells (what we did above)"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:57
# unordered list
msgid "- modify/refine/clean individual cells"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:58
# unordered list
msgid "- join multi-valued cells back together"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:60
msgid "Modifying cells will be covered in future lessons, but for now we will cover how to join cells back together that have been split previously."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:63
# unordered list
msgid "* Choose `Edit cells->Join multi-valued cells`"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:64
# unordered list
msgid "* In the prompt type the ( \\| ) symbol"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:65
# unordered list
msgid " * Here we are specifying the *delimiter* character for OpenRefine to use to join the values together."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:66
# unordered list
msgid "* Click `OK` to join the Authors cells back together"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:68
msgid "You will now see that split rows have gone away - the Authors have been joined into a single cell with the specified delimiter. Our Rows and\n"
"Records values will now be the same since we do not have any more columns with split (multi-valued) cells."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:71
# unordered list
msgid "* Click both the `Rows` and `Records` options and observe how the numbers of Rows and Records are equal"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:73
# header
msgid "### Choosing a good separator"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:75
msgid "The value that separates multi-valued cells is called a separator or delimiter. Choosing a good\n"
"separator is important. In the examples, we've seen the pipe character ( \\| ) has been used."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:78
msgid "Choosing the wrong separator can lead to problems. Consider the following multi-valued Author example.\n"
"with a pipe as a separator.\n"
"```\n"
"Jones, Andrew | Davis, S.\n"
"```"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:84
msgid "When we tell OpenRefine to split this cell on the pipe ( \\| ), we will get the following two authors each in their own cell since there is a single pipe character separating them."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:86
# unordered list
msgid "- **Author 1:** Jones, Andrew"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:87
# unordered list
msgid "- **Author 2:** Davis, S."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:89
msgid "Now imagine that the document creator had chosen a **comma** as the separator instead of a pipe."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:91
# code block
msgid "```\n"
"Jones, Andrew , Davis, S.\n"
"```"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:95
msgid "Can you spot the problem? Can you tell where one author stops and the next begins? "
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:97
msgid "OpenRefine will split on **every** comma it encounters,\n"
"so we'll end up with 4 authors, not two, because OpenRefine cannot tell that **Jones, Andrew** is supposed to be a single author. We will get\n"
"the following four \"authors\" because there are 3 commas separating the name parts."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:101
# unordered list
msgid "- **Author 1:** Jones"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:102
# unordered list
msgid "- **Author 2:** Andrew"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:103
# unordered list
msgid "- **Author 3:** Davis"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:104
# unordered list
msgid "- **Author 4:** S."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:106
msgid "Splitting on a comma will not work with Authors because the names may include commas within them."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:108
# blockquote, which can be cascaded
msgid "> ## Choose a separator that is not in your data values"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:109
msgid ">\n"
"> When creating a spreadsheet with multi-valued cells, it is important to choose a separator that will never appear in\n"
"> the cell values themselves. For this reason, the pipe character ( \\| ) is often a good choice since it\n"
"> is rarely used in data. Commas, colons and semi-colons should be avoided as separators."
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:115
msgid ">## Splitting Subjects into separate cells\n"
">\n"
">1. What separator character is used in the Subjects cells?\n"
">2. How would you split these subject words into individual cells?\n"
">\n"
"> > ## Solution\n"
"> > 1. The subject words/headings are divided up with the pipe ( \\| ) character\n"
"> > 2. To split the subject words into individual cells you need to:\n"
"> > * Click the dropdown menu at the top of the Subjects column\n"
"> > * Choose 'Edit cells->Split multi-valued cells'\n"
"> > * In the prompt type the ( \\| ) symbol and click 'OK'"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:126
#: lc-open-refine/_episodes/03-working-with-data.md:139
# SC/DC Template label
msgid "> {: .solution}"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:127
#: lc-open-refine/_episodes/03-working-with-data.md:140
#: lc-open-refine/_episodes/04-faceting-and-filtering.md:56
#: lc-open-refine/_episodes/04-faceting-and-filtering.md:99
#: lc-open-refine/_episodes/04-faceting-and-filtering.md:120
#: lc-open-refine/_episodes/05-clustering.md:37
#: lc-open-refine/_episodes/13-looking-up-data.md:80
#: lc-open-refine/_episodes/13-looking-up-data.md:149
# SC/DC Template label
msgid "{: .challenge}"
msgstr ""
#: lc-open-refine/_episodes/03-working-with-data.md:129
msgid ">## Joining the Subjects column back together\n"
">\n"
">1. Using what we've learned, now Join the Subjects back together\n"
">\n"
"> > ## Solution\n"
"> > 1. The subject words/headings were previously delimited with the pipe ( \\| ) character\n"
"> > 2. To join the split subject cells back to a single cell you need to:\n"
"> > * Click the dropdown menu at the top of the Subjects column\n"
"> > * Choose 'Edit cells->Join multi-valued cells'\n"
"> > * In the prompt type the ( \\| ) symbol and click 'OK'"
msgstr ""

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant