{ "cells": [ { "attachments": {}, "cell_type": "markdown", "metadata": {}, "source": [ "[](https://colab.research.google.com/github/LAION-AI/Open-Assistant/blob/main/notebooks/data-argumentation/StackExchangeBuilder.ipynb)" ] }, { "cell_type": "markdown", "metadata": { "id": "TB7CEfs8F-8u" }, "source": [ "# Ingest StackExchange data dumps\n", "This notebook takes a StackExchange Data dump \"Posts.xml\" file and ingests it into a Pandas Dataframe. Outputs of the file can be JSON, JSONL, Parquet, or CSV. " ] }, { "cell_type": "code", "execution_count": null, "metadata": { "id": "0rHryQttyzyY" }, "outputs": [], "source": [ "from bs4 import BeautifulSoup as bs\n", "import pandas as pd\n", "import requests\n", "import json" ] }, { "cell_type": "markdown", "metadata": { "id": "15mAL7GnzBv0" }, "source": [ "# Extract StackExchange\n", "Pull StackExchange file dumps. Specific column types are enforced to prevent errors on processing later in the notebook" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/" }, "id": "FtcvUEaHVxcW", "outputId": "5b0cb19d-e3d9-422b-9077-52241bd09e0e" }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "dict_keys(['3dprinting_meta', '3dprinting', 'Stackoverflow_com_Posts_7z', 'academia_meta', 'academia', 'ai_meta', 'ai', 'android_meta', 'android', 'anime_meta', 'anime', 'apple_meta', 'apple', 'arduino_meta', 'arduino', 'askubuntu_com_7z', 'astronomy_meta', 'astronomy', 'aviation_meta', 'aviation', 'avp_meta', 'avp', 'beer_meta', 'beer', 'bicycles_meta', 'bicycles', 'bioacoustics_meta', 'bioacoustics', 'bioinformatics_meta', 'bioinformatics', 'biology_meta', 'biology', 'bitcoin_meta', 'bitcoin', 'blender_meta', 'blender', 'boardgames_meta', 'boardgames', 'bricks_meta', 'bricks', 'buddhism_meta', 'buddhism', 'cardano_meta', 'cardano', 'chemistry_meta', 'chemistry', 'chess_meta', 'chess', 'chinese_meta', 'chinese', 'christianity_meta', 'christianity', 'civicrm_meta', 'civicrm', 'codegolf_meta', 'codegolf', 'codereview_meta', 'codereview', 'coffee_meta', 'coffee', 'cogsci_meta', 'cogsci', 'computergraphics_meta', 'computergraphics', 'conlang_meta', 'conlang', 'cooking_meta', 'cooking', 'craftcms_meta', 'craftcms', 'crafts_meta', 'crafts', 'crypto_meta', 'crypto', 'cs_meta', 'cs', 'cseducators_meta', 'cseducators', 'cstheory_meta', 'cstheory', 'datascience_meta', 'datascience', 'dba_meta', 'dba', 'devops_meta', 'devops', 'diy_meta', 'diy', 'drones_meta', 'drones', 'drupal_meta', 'drupal', 'dsp_meta', 'dsp', 'earthscience_meta', 'earthscience', 'ebooks_meta', 'ebooks', 'economics_meta', 'economics', 'electronics_meta', 'electronics', 'elementaryos_meta', 'elementaryos', 'ell_meta', 'ell', 'emacs_meta', 'emacs', 'engineering_meta', 'engineering', 'english_meta', 'english', 'eosio_meta', 'eosio', 'es_meta_stackoverflow_com_7z', 'es_stackoverflow_com_7z', 'esperanto_meta', 'esperanto', 'ethereum_meta', 'ethereum', 'expatriates_meta', 'expatriates', 'expressionengine_meta', 'expressionengine', 'fitness_meta', 'fitness', 'freelancing_meta', 'freelancing', 'french_meta', 'french', 'gamedev_meta', 'gamedev', 'gaming_meta', 'gaming', 'gardening_meta', 'gardening', 'genealogy_meta', 'genealogy', 'german_meta', 'german', 'gis_meta', 'gis', 'graphicdesign_meta', 'graphicdesign', 'ham_meta', 'ham', 'hardwarerecs_meta', 'hardwarerecs', 'health_meta', 'health', 'hermeneutics_meta', 'hermeneutics', 'hinduism_meta', 'hinduism', 'history_meta', 'history', 'homebrew_meta', 'homebrew', 'hsm_meta', 'hsm', 'interpersonal_meta', 'interpersonal', 'iot_meta', 'iot', 'iota_meta', 'iota', 'islam_meta', 'islam', 'italian_meta', 'italian', 'ja_meta_stackoverflow_com_7z', 'ja_stackoverflow_com_7z', 'japanese_meta', 'japanese', 'joomla_meta', 'joomla', 'judaism_meta', 'judaism', 'korean_meta', 'korean', 'languagelearning_meta', 'languagelearning', 'latin_meta', 'latin', 'law_meta', 'law', 'lifehacks_meta', 'lifehacks', 'linguistics_meta', 'linguistics', 'literature_meta', 'literature', 'magento_meta', 'magento', 'martialarts_meta', 'martialarts', 'materials_meta', 'materials', 'math_meta', 'math', 'matheducators_meta', 'matheducators', 'mathematica_meta', 'mathematica', 'mathoverflow_net_7z', 'mechanics_meta', 'mechanics', 'meta_askubuntu_com_7z', 'meta_mathoverflow_net_7z', 'meta_serverfault_com_7z', 'meta', 'meta_stackoverflow_com_7z', 'meta_superuser_com_7z', 'moderators_meta', 'moderators', 'monero_meta', 'monero', 'money_meta', 'money', 'movies_meta', 'movies', 'music_meta', 'music', 'musicfans_meta', 'musicfans', 'mythology_meta', 'mythology', 'networkengineering_meta', 'networkengineering', 'opendata_meta', 'opendata', 'opensource_meta', 'opensource', 'or_meta', 'or', 'outdoors_meta', 'outdoors', 'parenting_meta', 'parenting', 'patents_meta', 'patents', 'pets_meta', 'pets', 'philosophy_meta', 'philosophy', 'photo_meta', 'photo', 'physics_meta', 'physics', 'pm_meta', 'pm', 'poker_meta', 'poker', 'politics_meta', 'politics', 'portuguese_meta', 'portuguese', 'proofassistants_meta', 'proofassistants', 'pt_meta_stackoverflow_com_7z', 'pt_stackoverflow_com_7z', 'puzzling_meta', 'puzzling', 'quant_meta', 'quant', 'quantumcomputing_meta', 'quantumcomputing', 'raspberrypi_meta', 'raspberrypi', 'retrocomputing_meta', 'retrocomputing', 'reverseengineering_meta', 'reverseengineering', 'robotics_meta', 'robotics', 'rpg_meta', 'rpg', 'ru_meta_stackoverflow_com_7z', 'ru_stackoverflow_com_7z', 'rus_meta', 'rus', 'russian_meta', 'russian', 'salesforce_meta', 'salesforce', 'scicomp_meta', 'scicomp', 'scifi_meta', 'scifi', 'security_meta', 'security', 'serverfault_com_7z', 'sharepoint_meta', 'sharepoint', 'sitecore_meta', 'sitecore', 'skeptics_meta', 'skeptics', 'softwareengineering_meta', 'softwareengineering', 'softwarerecs_meta', 'softwarerecs', 'solana_meta', 'solana', 'sound_meta', 'sound', 'space_meta', 'space', 'spanish_meta', 'spanish', 'sports_meta', 'sports', 'sqa_meta', 'sqa', 'stackapps_com_7z', 'stackoverflow_com_Badges_7z', 'stackoverflow_com_Comments_7z', 'stackoverflow_com_PostHistory_7z', 'stackoverflow_com_PostLinks_7z', 'stackoverflow_com_Tags_7z', 'stackoverflow_com_Users_7z', 'stackoverflow_com_Votes_7z', 'stats_meta', 'stats', 'stellar_meta', 'stellar', 'substrate_meta', 'substrate', 'superuser_com_7z', 'sustainability_meta', 'sustainability', 'tex_meta', 'tex', 'tezos_meta', 'tezos', 'tor_meta', 'tor', 'travel_meta', 'travel', 'tridion_meta', 'tridion', 'ukrainian_meta', 'ukrainian', 'unix_meta', 'unix', 'ux_meta', 'ux', 'vegetarianism_meta', 'vegetarianism', 'vi_meta', 'vi', 'webapps_meta', 'webapps', 'webmasters_meta', 'webmasters', 'windowsphone_meta', 'windowsphone', 'woodworking_meta', 'woodworking', 'wordpress_meta', 'wordpress', 'workplace_meta', 'workplace', 'worldbuilding_meta', 'worldbuilding', 'writers_meta', 'writers'])\n", "https://ia600107.us.archive.org/view_archive.php?archive=/27/items/stackexchange/ai.stackexchange.com.7z&file=Posts.xml\n" ] } ], "source": [ "base_url = \"https://ia600107.us.archive.org/view_archive.php?archive=/27/items/stackexchange/{0}&file=Posts.xml\"\n", "\n", "\n", "def get_all_filenames():\n", " response = requests.get(\"https://archive.org/download/stackexchange\")\n", " if response.ok:\n", " soup = bs(response.content, \"html.parser\")\n", " table = soup.find(\"table\")\n", " link_tags = table.find_all(\"a\")\n", " urls = {}\n", " for link in link_tags:\n", " url = link[\"href\"]\n", " name = url.split(\".stackexchange\")[0].replace(\".\", \"_\").replace(\"-\", \"_\")\n", " if url.endswith(\"7z\"):\n", " urls[name] = base_url.format(url)\n", " return urls\n", "\n", "\n", "urls = get_all_filenames()\n", "\n", "print(urls.keys())\n", "print(urls.get(\"ai\"))" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 907 }, "id": "-t27RnxdzBYB", "outputId": "5ec0ceed-c82b-48fa-facd-41b4aae2f9e6" }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Id int64\n", "PostTypeId int64\n", "AcceptedAnswerId int64\n", "CreationDate object\n", "Score int64\n", "ViewCount int64\n", "Body object\n", "OwnerUserId object\n", "LastEditorUserId object\n", "LastEditDate object\n", "LastActivityDate object\n", "Title object\n", "Tags object\n", "AnswerCount int64\n", "CommentCount int64\n", "ContentLicense object\n", "ParentId int64\n", "ClosedDate object\n", "FavoriteCount object\n", "CommunityOwnedDate object\n", "LastEditorDisplayName object\n", "OwnerDisplayName object\n", "DataSource object\n", "dtype: object\n" ] }, { "data": { "text/html": [ "\n", "
| \n", " | Id | \n", "PostTypeId | \n", "AcceptedAnswerId | \n", "CreationDate | \n", "Score | \n", "ViewCount | \n", "Body | \n", "OwnerUserId | \n", "LastEditorUserId | \n", "LastEditDate | \n", "... | \n", "AnswerCount | \n", "CommentCount | \n", "ContentLicense | \n", "ParentId | \n", "ClosedDate | \n", "FavoriteCount | \n", "CommunityOwnedDate | \n", "LastEditorDisplayName | \n", "OwnerDisplayName | \n", "DataSource | \n", "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", "1 | \n", "1 | \n", "3 | \n", "2016-08-02T15:39:14.947 | \n", "10 | \n", "710 | \n", "<p>What does \"backprop\" mean? Is the \"backprop... | \n", "8 | \n", "2444 | \n", "2019-11-16T17:56:22.093 | \n", "... | \n", "5 | \n", "0 | \n", "CC BY-SA 4.0 | \n", "0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://ia600107.us.archive.org/view_archive.p... | \n", "
| 1 | \n", "2 | \n", "1 | \n", "9 | \n", "2016-08-02T15:40:20.623 | \n", "14 | \n", "1008 | \n", "<p>Does increasing the noise in data help to i... | \n", "8 | \n", "2444 | \n", "2019-02-23T22:36:19.090 | \n", "... | \n", "3 | \n", "0 | \n", "CC BY-SA 4.0 | \n", "0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://ia600107.us.archive.org/view_archive.p... | \n", "
| 2 | \n", "3 | \n", "2 | \n", "0 | \n", "2016-08-02T15:40:24.820 | \n", "15 | \n", "0 | \n", "<p>\"Backprop\" is the same as \"backpropagation\"... | \n", "4 | \n", "NaN | \n", "NaN | \n", "... | \n", "0 | \n", "0 | \n", "CC BY-SA 3.0 | \n", "1 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://ia600107.us.archive.org/view_archive.p... | \n", "
| 3 | \n", "4 | \n", "1 | \n", "12 | \n", "2016-08-02T15:41:22.020 | \n", "33 | \n", "1266 | \n", "<p>When you're writing your algorithm, how do ... | \n", "8 | \n", "2444 | \n", "2021-01-19T23:54:07.813 | \n", "... | \n", "4 | \n", "0 | \n", "CC BY-SA 3.0 | \n", "0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://ia600107.us.archive.org/view_archive.p... | \n", "
| 4 | \n", "6 | \n", "1 | \n", "20 | \n", "2016-08-02T15:43:35.460 | \n", "7 | \n", "279 | \n", "<p>Given the following definition of an intell... | \n", "29 | \n", "2444 | \n", "2019-06-15T18:25:58.513 | \n", "... | \n", "2 | \n", "0 | \n", "CC BY-SA 4.0 | \n", "0 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "https://ia600107.us.archive.org/view_archive.p... | \n", "
5 rows × 23 columns
\n", "What does \"backprop\" mean? Is the \"backprop... 8 \n", "1 1008
Does increasing the noise in data help to i... 8 \n", "2 0
\"Backprop\" is the same as \"backpropagation\"... 4 \n", "3 1266
When you're writing your algorithm, how do ... 8 \n", "4 279
Given the following definition of an intell... 29 \n", "\n", " LastEditorUserId LastEditDate ... AnswerCount CommentCount \\\n", "0 2444 2019-11-16T17:56:22.093 ... 5 0 \n", "1 2444 2019-02-23T22:36:19.090 ... 3 0 \n", "2 NaN NaN ... 0 0 \n", "3 2444 2021-01-19T23:54:07.813 ... 4 0 \n", "4 2444 2019-06-15T18:25:58.513 ... 2 0 \n", "\n", " ContentLicense ParentId ClosedDate FavoriteCount CommunityOwnedDate \\\n", "0 CC BY-SA 4.0 0 NaN NaN NaN \n", "1 CC BY-SA 4.0 0 NaN NaN NaN \n", "2 CC BY-SA 3.0 1 NaN NaN NaN \n", "3 CC BY-SA 3.0 0 NaN NaN NaN \n", "4 CC BY-SA 4.0 0 NaN NaN NaN \n", "\n", " LastEditorDisplayName OwnerDisplayName \\\n", "0 NaN NaN \n", "1 NaN NaN \n", "2 NaN NaN \n", "3 NaN NaN \n", "4 NaN NaN \n", "\n", " DataSource \n", "0 https://ia600107.us.archive.org/view_archive.p... \n", "1 https://ia600107.us.archive.org/view_archive.p... \n", "2 https://ia600107.us.archive.org/view_archive.p... \n", "3 https://ia600107.us.archive.org/view_archive.p... \n", "4 https://ia600107.us.archive.org/view_archive.p... \n", "\n", "[5 rows x 23 columns]" ] }, "execution_count": 219, "metadata": {}, "output_type": "execute_result" } ], "source": [ "xml_format_map = {\n", " \"Id\": int,\n", " \"PostTypeId\": int,\n", " \"CreationDate\": str,\n", " \"Score\": int,\n", " \"ViewCount\": int,\n", " \"Body\": str,\n", " \"AnswerCount\": int,\n", " \"CommentCount\": int,\n", " \"ContentLicense\": str,\n", " \"AcceptedAnswerId\": int,\n", " \"ParentId\": int,\n", "}\n", "\n", "\n", "# def extract_xml_file(file_url: str):\n", "# table = pd.read_xml(file_url)\n", "# return table\n", "\n", "\n", "def xml_to_df(response: str):\n", " \"\"\"\n", " Collect and Manually import XML into Dataframe\n", "\n", " pd.read_xml() errors when XML trees are too large, this is just a hack to\n", " download a XML file and parse into a Dataframe. **Not Tested on huge XML files**\n", "\n", " Parameters:\n", " response (Requests.Response): Requests response object with the XML data\n", "\n", " Returns:\n", " df (DataFrame): A Dataframe from the XML file\n", " \"\"\"\n", " soup = bs(response.content, \"xml\")\n", " posts = soup.find_all(\"row\")\n", "\n", " all_posts = [post.attrs for post in posts]\n", "\n", " df = pd.DataFrame(all_posts)\n", " df.AnswerCount.fillna(0, inplace=True)\n", " df.ViewCount.fillna(0, inplace=True)\n", " df.AcceptedAnswerId.fillna(0, inplace=True)\n", " df.ParentId.fillna(0, inplace=True)\n", " df[\"DataSource\"] = response.url\n", " df = df.astype(xml_format_map)\n", " return df\n", "\n", "\n", "dataset_name = \"ai\"\n", "\n", "xml_posts_path = urls.get(dataset_name)\n", "\n", "\n", "# df = extract_xml_file(test)\n", "response = requests.get(xml_posts_path)\n", "df = xml_to_df(response)\n", "\n", "\n", "print(df.dtypes)\n", "df.head()" ] }, { "cell_type": "markdown", "metadata": { "id": "RAzTR7zY3oan" }, "source": [ "# Transformations" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 424 }, "id": "qyUqc31Z3Z9g", "outputId": "18dce8b4-af26-49c9-ee73-6c677177b516" }, "outputs": [ { "data": { "text/html": [ "\n", "
| \n", " | Body | \n", "BodyClean | \n", "Tags | \n", "TagsClean | \n", "
|---|---|---|---|---|
| 0 | \n", "<p>What does \"backprop\" mean? Is the \"backprop... | \n", "What does \"backprop\" mean? Is the \"backprop\" t... | \n", "<neural-networks><backpropagation><terminology... | \n", "neural networks, backpropagation, terminology,... | \n", "
| 1 | \n", "<p>Does increasing the noise in data help to i... | \n", "Does increasing the noise in data help to impr... | \n", "<neural-networks><machine-learning><statistica... | \n", "neural networks, machine learning, statistical... | \n", "
| 2 | \n", "<p>\"Backprop\" is the same as \"backpropagation\"... | \n", "\"Backprop\" is the same as \"backpropagation\": i... | \n", "NaN | \n", "NaN | \n", "
| 3 | \n", "<p>When you're writing your algorithm, how do ... | \n", "When you're writing your algorithm, how do you... | \n", "<neural-networks><hyperparameter-optimization>... | \n", "neural networks, hyperparameter optimization, ... | \n", "
| 4 | \n", "<p>Given the following definition of an intell... | \n", "Given the following definition of an intellige... | \n", "<philosophy><definitions><intelligent-agent> | \n", "philosophy, definitions, intelligent agent | \n", "
| ... | \n", "... | \n", "... | \n", "... | \n", "... | \n", "
| 23174 | \n", "<p>The purpose of evaluating the state and act... | \n", "The purpose of evaluating the state and action... | \n", "NaN | \n", "NaN | \n", "
| 23175 | \n", "<p>In machine translation, convolution is a te... | \n", "In machine translation, convolution is a techn... | \n", "NaN | \n", "NaN | \n", "
| 23176 | \n", "<p>One of the key features of ChatGPT is its a... | \n", "One of the key features of ChatGPT is its abil... | \n", "NaN | \n", "NaN | \n", "
| 23177 | \n", "<p>Given a neural network model for Covid-19 c... | \n", "Given a neural network model for Covid-19 clas... | \n", "<neural-networks><homework> | \n", "neural networks, homework | \n", "
| 23178 | \n", "<p>My question is more related to the fundamen... | \n", "My question is more related to the fundamental... | \n", "<search><constraint-satisfaction-problems> | \n", "search, constraint satisfaction problems | \n", "
23179 rows × 4 columns
\n", "What does \"backprop\" mean? Is the \"backprop... \n", "1
Does increasing the noise in data help to i... \n", "2
\"Backprop\" is the same as \"backpropagation\"... \n", "3
When you're writing your algorithm, how do ... \n", "4
Given the following definition of an intell... \n", "... ... \n", "23174
The purpose of evaluating the state and act... \n", "23175
In machine translation, convolution is a te... \n", "23176
One of the key features of ChatGPT is its a... \n", "23177
Given a neural network model for Covid-19 c... \n", "23178
My question is more related to the fundamen... \n",
"\n",
" BodyClean \\\n",
"0 What does \"backprop\" mean? Is the \"backprop\" t... \n",
"1 Does increasing the noise in data help to impr... \n",
"2 \"Backprop\" is the same as \"backpropagation\": i... \n",
"3 When you're writing your algorithm, how do you... \n",
"4 Given the following definition of an intellige... \n",
"... ... \n",
"23174 The purpose of evaluating the state and action... \n",
"23175 In machine translation, convolution is a techn... \n",
"23176 One of the key features of ChatGPT is its abil... \n",
"23177 Given a neural network model for Covid-19 clas... \n",
"23178 My question is more related to the fundamental... \n",
"\n",
" Tags \\\n",
"0 \n",
" \n",
"
\n",
"\n",
" \n",
" \n",
" \n",
" \n",
" Id_q \n",
" Question \n",
" QuestionScore \n",
" QuestionTags \n",
" Id_a \n",
" Answer \n",
" AnswerScore \n",
" AcceptedAnswerFlag \n",
" \n",
" \n",
" 0 \n",
" 1768 \n",
" In Portal 2 we see that AI's can be \"killed\" b... \n",
" 175 \n",
" philosophy, decision theory, mythology of ai, ... \n",
" 1769.0 \n",
" This classic problem exhibits a basic misunder... \n",
" 146.0 \n",
" True \n",
" \n",
" \n",
" 1 \n",
" 10623 \n",
" What is self-supervised learning in machine le... \n",
" 91 \n",
" machine learning, comparison, supervised learn... \n",
" 10624.0 \n",
" Introduction\\nThe term self-supervised learnin... \n",
" 90.0 \n",
" True \n",
" \n",
" \n",
" 2 \n",
" 111 \n",
" Obviously, self-driving cars aren't perfect, s... \n",
" 100 \n",
" philosophy, ethics, autonomous vehicles, decis... \n",
" 1790.0 \n",
" \\nHow could self-driving cars make ethical dec... \n",
" 76.0 \n",
" True \n",
" \n",
" \n",
" 3 \n",
" 14224 \n",
" If the original purpose for developing AI was ... \n",
" 69 \n",
" philosophy, social, explainable ai \n",
" 14247.0 \n",
" As argued by Selvaraju et al., there are three... \n",
" 75.0 \n",
" True \n",
" \n",
" \n",
" \n",
"4 \n",
" 1479 \n",
" Do scientists or research experts know from th... \n",
" 94 \n",
" neural networks, deep learning, convolutional ... \n",
" 4044.0 \n",
" There are many approaches that aim to make a t... \n",
" 69.0 \n",
" True \n",
" \n",
" \n",
"
\n",
"\n",
" \n",
" \n",
" \n",
" \n",
" Id_q \n",
" Question \n",
" ParentId_a \n",
" AcceptedAnswerId \n",
" Id_a \n",
" Answer \n",
" AnswerScore \n",
" AcceptedAnswerFlag \n",
" \n",
" \n",
" 7 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15744.0 \n",
" I think this is a fairly common misconception ... \n",
" 62.0 \n",
" True \n",
" \n",
" \n",
" 3662 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15753.0 \n",
" I think your premise is flawed.\\nYou seem to a... \n",
" 19.0 \n",
" False \n",
" \n",
" \n",
" 3713 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15747.0 \n",
" TL;DR: The subtleties of infinity are made app... \n",
" 12.0 \n",
" False \n",
" \n",
" \n",
" 3788 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15756.0 \n",
" In Haskell, you can type:\\nprint [1..]\\nand it... \n",
" 9.0 \n",
" False \n",
" \n",
" \n",
" 3821 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15758.0 \n",
" I believe humans can be said to understand inf... \n",
" 8.0 \n",
" False \n",
" \n",
" \n",
" 3882 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15762.0 \n",
" (There's a summary at the bottom for those who... \n",
" 7.0 \n",
" False \n",
" \n",
" \n",
" 4389 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15783.0 \n",
" Then premise assumes that humans \"understand\" ... \n",
" 4.0 \n",
" False \n",
" \n",
" \n",
" 4849 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15740.0 \n",
" By adding some rules for infinity in arithmeti... \n",
" 3.0 \n",
" False \n",
" \n",
" \n",
" 4850 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15803.0 \n",
" I think the concept that is missing in the dis... \n",
" 3.0 \n",
" False \n",
" \n",
" \n",
" 5763 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15768.0 \n",
" Computers don't understand \"infinity\" or even ... \n",
" 2.0 \n",
" False \n",
" \n",
" \n",
" 5764 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15810.0 \n",
" The Questions That Computers Can Never Answer ... \n",
" 2.0 \n",
" False \n",
" \n",
" \n",
" 5765 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15943.0 \n",
" John Doucette's answer covers my thoughts on t... \n",
" 2.0 \n",
" False \n",
" \n",
" \n",
" 7462 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15779.0 \n",
" I would think that a computer couldn’t underst... \n",
" 1.0 \n",
" False \n",
" \n",
" \n",
" 7463 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15787.0 \n",
" The \"concept\" of infinity is 1 thing to unders... \n",
" 1.0 \n",
" False \n",
" \n",
" \n",
" 7464 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15801.0 \n",
" Just food for thought: how about if we try to ... \n",
" 1.0 \n",
" False \n",
" \n",
" \n",
" 7465 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15930.0 \n",
" Its arguable if we humans understand infinity.... \n",
" 1.0 \n",
" False \n",
" \n",
" \n",
" 7466 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15934.0 \n",
" Well -- just to touch on the question of peopl... \n",
" 1.0 \n",
" False \n",
" \n",
" \n",
" 7467 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15938.0 \n",
" Humans certainly don't understand infinity. Cu... \n",
" 1.0 \n",
" False \n",
" \n",
" \n",
" \n",
"9481 \n",
" 15730 \n",
" As a human being, we can think infinity. In pr... \n",
" 15730.0 \n",
" 15744 \n",
" 15931.0 \n",
" I think the property humans have which compute... \n",
" 0.0 \n",
" False \n",
"