diff --git a/notebooks/data-argumentation/StackExchangeBuilder.ipynb b/notebooks/data-argumentation/StackExchangeBuilder.ipynb index ac215cbe..625d757b 100644 --- a/notebooks/data-argumentation/StackExchangeBuilder.ipynb +++ b/notebooks/data-argumentation/StackExchangeBuilder.ipynb @@ -1,1833 +1,1845 @@ { - "nbformat": 4, - "nbformat_minor": 0, - "metadata": { - "colab": { - "provenance": [] - }, - "kernelspec": { - "name": "python3", - "display_name": "Python 3" - }, - "language_info": { - "name": "python" - } + "nbformat": 4, + "nbformat_minor": 0, + "metadata": { + "colab": { + "provenance": [] }, - "cells": [ - { - "cell_type": "markdown", - "source": [ - "# Ingest StackExchange data dumps\n", - "This notebook takes a StackExchange Data dump \"Posts.xml\" file and ingests it into a Pandas Dataframe. Outputs of the file can be JSON, JSONL, Parquet, or CSV. " - ], - "metadata": { - "id": "TB7CEfs8F-8u" - } + "kernelspec": { + "name": "python3", + "display_name": "Python 3" + }, + "language_info": { + "name": "python" + } + }, + "cells": [ + { + "cell_type": "markdown", + "source": [ + "# Ingest StackExchange data dumps\n", + "This notebook takes a StackExchange Data dump \"Posts.xml\" file and ingests it into a Pandas Dataframe. Outputs of the file can be JSON, JSONL, Parquet, or CSV. " + ], + "metadata": { + "id": "TB7CEfs8F-8u" + } + }, + { + "cell_type": "code", + "execution_count": null, + "metadata": { + "id": "0rHryQttyzyY" + }, + "outputs": [], + "source": [ + "from bs4 import BeautifulSoup as bs\n", + "import pandas as pd\n", + "import requests\n", + "import json" + ] + }, + { + "cell_type": "markdown", + "source": [ + "# Extract StackExchange\n", + "Pull StackExchange file dumps. Specific column types are enforced to prevent errors on processing later in the notebook" + ], + "metadata": { + "id": "15mAL7GnzBv0" + } + }, + { + "cell_type": "code", + "source": [ + "base_url = \"https://ia600107.us.archive.org/view_archive.php?archive=/27/items/stackexchange/{0}&file=Posts.xml\"\n", + "\n", + "\n", + "def get_all_filenames():\n", + " response = requests.get(\"https://archive.org/download/stackexchange\")\n", + " if response.ok:\n", + " soup = bs(response.content, \"html.parser\")\n", + " table = soup.find(\"table\")\n", + " link_tags = table.find_all(\"a\")\n", + " urls = {}\n", + " for link in link_tags:\n", + " url = link[\"href\"]\n", + " name = url.split(\".stackexchange\")[0].replace(\".\", \"_\").replace(\"-\", \"_\")\n", + " if url.endswith(\"7z\"):\n", + " urls[name] = base_url.format(url)\n", + " return urls\n", + "\n", + "\n", + "urls = get_all_filenames()\n", + "\n", + "print(urls.keys())\n", + "print(urls.get(\"ai\"))" + ], + "metadata": { + "colab": { + "base_uri": "https://localhost:8080/" }, + "id": "FtcvUEaHVxcW", + "outputId": "5b0cb19d-e3d9-422b-9077-52241bd09e0e" + }, + "execution_count": null, + "outputs": [ { - "cell_type": "code", - "execution_count": null, - "metadata": { - "id": "0rHryQttyzyY" - }, - "outputs": [], - "source": [ - "from bs4 import BeautifulSoup as bs\n", - "import pandas as pd\n", - "import requests\n", - "import json" - ] - }, - { - "cell_type": "markdown", - "source": [ - "# Extract StackExchange\n", - "Pull StackExchange file dumps. Specific column types are enforced to prevent errors on processing later in the notebook" - ], - "metadata": { - "id": "15mAL7GnzBv0" - } - }, - { - "cell_type": "code", - "source": [ - "\n", - "base_url = \"https://ia600107.us.archive.org/view_archive.php?archive=/27/items/stackexchange/{0}&file=Posts.xml\"\n", - "\n", - "def get_all_filenames():\n", - " response = requests.get(\"https://archive.org/download/stackexchange\")\n", - " if response.ok:\n", - " soup = bs(response.content, 'html.parser')\n", - " table = soup.find('table')\n", - " link_tags = table.find_all('a')\n", - " urls = {}\n", - " for link in link_tags:\n", - " url = link['href']\n", - " name = url.split('.stackexchange')[0].replace('.', '_').replace('-', '_')\n", - " if url.endswith('7z'):\n", - " urls[name] = base_url.format(url)\n", - " return urls\n", - "\n", - "urls = get_all_filenames()\n", - "\n", - "print(urls.keys())\n", - "print(urls.get('ai'))" - ], - "metadata": { - "colab": { - "base_uri": "https://localhost:8080/" - }, - "id": "FtcvUEaHVxcW", - "outputId": "5b0cb19d-e3d9-422b-9077-52241bd09e0e" - }, - "execution_count": null, - "outputs": [ - { - "output_type": "stream", - "name": "stdout", - "text": [ - "dict_keys(['3dprinting_meta', '3dprinting', 'Stackoverflow_com_Posts_7z', 'academia_meta', 'academia', 'ai_meta', 'ai', 'android_meta', 'android', 'anime_meta', 'anime', 'apple_meta', 'apple', 'arduino_meta', 'arduino', 'askubuntu_com_7z', 'astronomy_meta', 'astronomy', 'aviation_meta', 'aviation', 'avp_meta', 'avp', 'beer_meta', 'beer', 'bicycles_meta', 'bicycles', 'bioacoustics_meta', 'bioacoustics', 'bioinformatics_meta', 'bioinformatics', 'biology_meta', 'biology', 'bitcoin_meta', 'bitcoin', 'blender_meta', 'blender', 'boardgames_meta', 'boardgames', 'bricks_meta', 'bricks', 'buddhism_meta', 'buddhism', 'cardano_meta', 'cardano', 'chemistry_meta', 'chemistry', 'chess_meta', 'chess', 'chinese_meta', 'chinese', 'christianity_meta', 'christianity', 'civicrm_meta', 'civicrm', 'codegolf_meta', 'codegolf', 'codereview_meta', 'codereview', 'coffee_meta', 'coffee', 'cogsci_meta', 'cogsci', 'computergraphics_meta', 'computergraphics', 'conlang_meta', 'conlang', 'cooking_meta', 'cooking', 'craftcms_meta', 'craftcms', 'crafts_meta', 'crafts', 'crypto_meta', 'crypto', 'cs_meta', 'cs', 'cseducators_meta', 'cseducators', 'cstheory_meta', 'cstheory', 'datascience_meta', 'datascience', 'dba_meta', 'dba', 'devops_meta', 'devops', 'diy_meta', 'diy', 'drones_meta', 'drones', 'drupal_meta', 'drupal', 'dsp_meta', 'dsp', 'earthscience_meta', 'earthscience', 'ebooks_meta', 'ebooks', 'economics_meta', 'economics', 'electronics_meta', 'electronics', 'elementaryos_meta', 'elementaryos', 'ell_meta', 'ell', 'emacs_meta', 'emacs', 'engineering_meta', 'engineering', 'english_meta', 'english', 'eosio_meta', 'eosio', 'es_meta_stackoverflow_com_7z', 'es_stackoverflow_com_7z', 'esperanto_meta', 'esperanto', 'ethereum_meta', 'ethereum', 'expatriates_meta', 'expatriates', 'expressionengine_meta', 'expressionengine', 'fitness_meta', 'fitness', 'freelancing_meta', 'freelancing', 'french_meta', 'french', 'gamedev_meta', 'gamedev', 'gaming_meta', 'gaming', 'gardening_meta', 'gardening', 'genealogy_meta', 'genealogy', 'german_meta', 'german', 'gis_meta', 'gis', 'graphicdesign_meta', 'graphicdesign', 'ham_meta', 'ham', 'hardwarerecs_meta', 'hardwarerecs', 'health_meta', 'health', 'hermeneutics_meta', 'hermeneutics', 'hinduism_meta', 'hinduism', 'history_meta', 'history', 'homebrew_meta', 'homebrew', 'hsm_meta', 'hsm', 'interpersonal_meta', 'interpersonal', 'iot_meta', 'iot', 'iota_meta', 'iota', 'islam_meta', 'islam', 'italian_meta', 'italian', 'ja_meta_stackoverflow_com_7z', 'ja_stackoverflow_com_7z', 'japanese_meta', 'japanese', 'joomla_meta', 'joomla', 'judaism_meta', 'judaism', 'korean_meta', 'korean', 'languagelearning_meta', 'languagelearning', 'latin_meta', 'latin', 'law_meta', 'law', 'lifehacks_meta', 'lifehacks', 'linguistics_meta', 'linguistics', 'literature_meta', 'literature', 'magento_meta', 'magento', 'martialarts_meta', 'martialarts', 'materials_meta', 'materials', 'math_meta', 'math', 'matheducators_meta', 'matheducators', 'mathematica_meta', 'mathematica', 'mathoverflow_net_7z', 'mechanics_meta', 'mechanics', 'meta_askubuntu_com_7z', 'meta_mathoverflow_net_7z', 'meta_serverfault_com_7z', 'meta', 'meta_stackoverflow_com_7z', 'meta_superuser_com_7z', 'moderators_meta', 'moderators', 'monero_meta', 'monero', 'money_meta', 'money', 'movies_meta', 'movies', 'music_meta', 'music', 'musicfans_meta', 'musicfans', 'mythology_meta', 'mythology', 'networkengineering_meta', 'networkengineering', 'opendata_meta', 'opendata', 'opensource_meta', 'opensource', 'or_meta', 'or', 'outdoors_meta', 'outdoors', 'parenting_meta', 'parenting', 'patents_meta', 'patents', 'pets_meta', 'pets', 'philosophy_meta', 'philosophy', 'photo_meta', 'photo', 'physics_meta', 'physics', 'pm_meta', 'pm', 'poker_meta', 'poker', 'politics_meta', 'politics', 'portuguese_meta', 'portuguese', 'proofassistants_meta', 'proofassistants', 'pt_meta_stackoverflow_com_7z', 'pt_stackoverflow_com_7z', 'puzzling_meta', 'puzzling', 'quant_meta', 'quant', 'quantumcomputing_meta', 'quantumcomputing', 'raspberrypi_meta', 'raspberrypi', 'retrocomputing_meta', 'retrocomputing', 'reverseengineering_meta', 'reverseengineering', 'robotics_meta', 'robotics', 'rpg_meta', 'rpg', 'ru_meta_stackoverflow_com_7z', 'ru_stackoverflow_com_7z', 'rus_meta', 'rus', 'russian_meta', 'russian', 'salesforce_meta', 'salesforce', 'scicomp_meta', 'scicomp', 'scifi_meta', 'scifi', 'security_meta', 'security', 'serverfault_com_7z', 'sharepoint_meta', 'sharepoint', 'sitecore_meta', 'sitecore', 'skeptics_meta', 'skeptics', 'softwareengineering_meta', 'softwareengineering', 'softwarerecs_meta', 'softwarerecs', 'solana_meta', 'solana', 'sound_meta', 'sound', 'space_meta', 'space', 'spanish_meta', 'spanish', 'sports_meta', 'sports', 'sqa_meta', 'sqa', 'stackapps_com_7z', 'stackoverflow_com_Badges_7z', 'stackoverflow_com_Comments_7z', 'stackoverflow_com_PostHistory_7z', 'stackoverflow_com_PostLinks_7z', 'stackoverflow_com_Tags_7z', 'stackoverflow_com_Users_7z', 'stackoverflow_com_Votes_7z', 'stats_meta', 'stats', 'stellar_meta', 'stellar', 'substrate_meta', 'substrate', 'superuser_com_7z', 'sustainability_meta', 'sustainability', 'tex_meta', 'tex', 'tezos_meta', 'tezos', 'tor_meta', 'tor', 'travel_meta', 'travel', 'tridion_meta', 'tridion', 'ukrainian_meta', 'ukrainian', 'unix_meta', 'unix', 'ux_meta', 'ux', 'vegetarianism_meta', 'vegetarianism', 'vi_meta', 'vi', 'webapps_meta', 'webapps', 'webmasters_meta', 'webmasters', 'windowsphone_meta', 'windowsphone', 'woodworking_meta', 'woodworking', 'wordpress_meta', 'wordpress', 'workplace_meta', 'workplace', 'worldbuilding_meta', 'worldbuilding', 'writers_meta', 'writers'])\n", - "https://ia600107.us.archive.org/view_archive.php?archive=/27/items/stackexchange/ai.stackexchange.com.7z&file=Posts.xml\n" - ] - } - ] - }, - { - "cell_type": "code", - "source": [ - "\n", - "xml_format_map = {'Id': int, 'PostTypeId': int, 'CreationDate': str, \n", - " 'Score': int, 'ViewCount': int, 'Body': str, 'AnswerCount': int, \n", - " 'CommentCount': int, 'ContentLicense': str, 'AcceptedAnswerId': int, \"ParentId\": int}\n", - "\n", - "\n", - "# def extract_xml_file(file_url: str):\n", - "# table = pd.read_xml(file_url)\n", - "# return table\n", - "\n", - "def xml_to_df(response: str):\n", - " \"\"\"\n", - " Collect and Manually import XML into Dataframe\n", - "\n", - " pd.read_xml() errors when XML trees are too large, this is just a hack to\n", - " download a XML file and parse into a Dataframe. **Not Tested on huge XML files**\n", - "\n", - " Parameters:\n", - " response (Requests.Response): Requests response object with the XML data\n", - "\n", - " Returns:\n", - " df (DataFrame): A Dataframe from the XML file\n", - " \"\"\" \n", - " soup = bs(response.content, 'xml')\n", - " posts = soup.find_all('row')\n", - "\n", - " all_posts = [post.attrs for post in posts]\n", - "\n", - " df = pd.DataFrame(all_posts)\n", - " df.AnswerCount.fillna(0, inplace=True)\n", - " df.ViewCount.fillna(0, inplace=True)\n", - " df.AcceptedAnswerId.fillna(0, inplace=True)\n", - " df.ParentId.fillna(0, inplace=True)\n", - " df['DataSource'] = response.url\n", - " df = df.astype(xml_format_map)\n", - " return df\n", - "\n", - "\n", - "\n", - "dataset_name = \"ai\"\n", - "\n", - "xml_posts_path = urls.get(dataset_name)\n", - "\n", - "\n", - "# df = extract_xml_file(test)\n", - "response = requests.get(xml_posts_path)\n", - "df = xml_to_df(response)\n", - "\n", - "\n", - "print(df.dtypes)\n", - "df.head()" - ], - "metadata": { - "colab": { - "base_uri": "https://localhost:8080/", - "height": 907 - }, - "id": "-t27RnxdzBYB", - "outputId": "5ec0ceed-c82b-48fa-facd-41b4aae2f9e6" - }, - "execution_count": null, - "outputs": [ - { - "output_type": "stream", - "name": "stdout", - "text": [ - "Id int64\n", - "PostTypeId int64\n", - "AcceptedAnswerId int64\n", - "CreationDate object\n", - "Score int64\n", - "ViewCount int64\n", - "Body object\n", - "OwnerUserId object\n", - "LastEditorUserId object\n", - "LastEditDate object\n", - "LastActivityDate object\n", - "Title object\n", - "Tags object\n", - "AnswerCount int64\n", - "CommentCount int64\n", - "ContentLicense object\n", - "ParentId int64\n", - "ClosedDate object\n", - "FavoriteCount object\n", - "CommunityOwnedDate object\n", - "LastEditorDisplayName object\n", - "OwnerDisplayName object\n", - "DataSource object\n", - "dtype: object\n" - ] - }, - { - "output_type": "execute_result", - "data": { - "text/plain": [ - " Id PostTypeId AcceptedAnswerId CreationDate Score \\\n", - "0 1 1 3 2016-08-02T15:39:14.947 10 \n", - "1 2 1 9 2016-08-02T15:40:20.623 14 \n", - "2 3 2 0 2016-08-02T15:40:24.820 15 \n", - "3 4 1 12 2016-08-02T15:41:22.020 33 \n", - "4 6 1 20 2016-08-02T15:43:35.460 7 \n", - "\n", - " ViewCount Body OwnerUserId \\\n", - "0 710
What does \"backprop\" mean? Is the \"backprop... 8 \n", - "1 1008
Does increasing the noise in data help to i... 8 \n", - "2 0
\"Backprop\" is the same as \"backpropagation\"... 4 \n", - "3 1266
When you're writing your algorithm, how do ... 8 \n", - "4 279
Given the following definition of an intell... 29 \n", - "\n", - " LastEditorUserId LastEditDate ... AnswerCount CommentCount \\\n", - "0 2444 2019-11-16T17:56:22.093 ... 5 0 \n", - "1 2444 2019-02-23T22:36:19.090 ... 3 0 \n", - "2 NaN NaN ... 0 0 \n", - "3 2444 2021-01-19T23:54:07.813 ... 4 0 \n", - "4 2444 2019-06-15T18:25:58.513 ... 2 0 \n", - "\n", - " ContentLicense ParentId ClosedDate FavoriteCount CommunityOwnedDate \\\n", - "0 CC BY-SA 4.0 0 NaN NaN NaN \n", - "1 CC BY-SA 4.0 0 NaN NaN NaN \n", - "2 CC BY-SA 3.0 1 NaN NaN NaN \n", - "3 CC BY-SA 3.0 0 NaN NaN NaN \n", - "4 CC BY-SA 4.0 0 NaN NaN NaN \n", - "\n", - " LastEditorDisplayName OwnerDisplayName \\\n", - "0 NaN NaN \n", - "1 NaN NaN \n", - "2 NaN NaN \n", - "3 NaN NaN \n", - "4 NaN NaN \n", - "\n", - " DataSource \n", - "0 https://ia600107.us.archive.org/view_archive.p... \n", - "1 https://ia600107.us.archive.org/view_archive.p... \n", - "2 https://ia600107.us.archive.org/view_archive.p... \n", - "3 https://ia600107.us.archive.org/view_archive.p... \n", - "4 https://ia600107.us.archive.org/view_archive.p... \n", - "\n", - "[5 rows x 23 columns]" - ], - "text/html": [ - "\n", - "
| \n", - " | Id | \n", - "PostTypeId | \n", - "AcceptedAnswerId | \n", - "CreationDate | \n", - "Score | \n", - "ViewCount | \n", - "Body | \n", - "OwnerUserId | \n", - "LastEditorUserId | \n", - "LastEditDate | \n", - "... | \n", - "AnswerCount | \n", - "CommentCount | \n", - "ContentLicense | \n", - "ParentId | \n", - "ClosedDate | \n", - "FavoriteCount | \n", - "CommunityOwnedDate | \n", - "LastEditorDisplayName | \n", - "OwnerDisplayName | \n", - "DataSource | \n", - "
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | \n", - "1 | \n", - "1 | \n", - "3 | \n", - "2016-08-02T15:39:14.947 | \n", - "10 | \n", - "710 | \n", - "<p>What does \"backprop\" mean? Is the \"backprop... | \n", - "8 | \n", - "2444 | \n", - "2019-11-16T17:56:22.093 | \n", - "... | \n", - "5 | \n", - "0 | \n", - "CC BY-SA 4.0 | \n", - "0 | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "https://ia600107.us.archive.org/view_archive.p... | \n", - "
| 1 | \n", - "2 | \n", - "1 | \n", - "9 | \n", - "2016-08-02T15:40:20.623 | \n", - "14 | \n", - "1008 | \n", - "<p>Does increasing the noise in data help to i... | \n", - "8 | \n", - "2444 | \n", - "2019-02-23T22:36:19.090 | \n", - "... | \n", - "3 | \n", - "0 | \n", - "CC BY-SA 4.0 | \n", - "0 | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "https://ia600107.us.archive.org/view_archive.p... | \n", - "
| 2 | \n", - "3 | \n", - "2 | \n", - "0 | \n", - "2016-08-02T15:40:24.820 | \n", - "15 | \n", - "0 | \n", - "<p>\"Backprop\" is the same as \"backpropagation\"... | \n", - "4 | \n", - "NaN | \n", - "NaN | \n", - "... | \n", - "0 | \n", - "0 | \n", - "CC BY-SA 3.0 | \n", - "1 | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "https://ia600107.us.archive.org/view_archive.p... | \n", - "
| 3 | \n", - "4 | \n", - "1 | \n", - "12 | \n", - "2016-08-02T15:41:22.020 | \n", - "33 | \n", - "1266 | \n", - "<p>When you're writing your algorithm, how do ... | \n", - "8 | \n", - "2444 | \n", - "2021-01-19T23:54:07.813 | \n", - "... | \n", - "4 | \n", - "0 | \n", - "CC BY-SA 3.0 | \n", - "0 | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "https://ia600107.us.archive.org/view_archive.p... | \n", - "
| 4 | \n", - "6 | \n", - "1 | \n", - "20 | \n", - "2016-08-02T15:43:35.460 | \n", - "7 | \n", - "279 | \n", - "<p>Given the following definition of an intell... | \n", - "29 | \n", - "2444 | \n", - "2019-06-15T18:25:58.513 | \n", - "... | \n", - "2 | \n", - "0 | \n", - "CC BY-SA 4.0 | \n", - "0 | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "NaN | \n", - "https://ia600107.us.archive.org/view_archive.p... | \n", - "
5 rows × 23 columns
\n", - "What does \"backprop\" mean? Is the \"backprop... \n", - "1
Does increasing the noise in data help to i... \n", - "2
\"Backprop\" is the same as \"backpropagation\"... \n", - "3
When you're writing your algorithm, how do ... \n", - "4
Given the following definition of an intell... \n", - "... ... \n", - "23174
The purpose of evaluating the state and act... \n", - "23175
In machine translation, convolution is a te... \n", - "23176
One of the key features of ChatGPT is its a... \n", - "23177
Given a neural network model for Covid-19 c... \n", - "23178
My question is more related to the fundamen... \n",
- "\n",
- " BodyClean \\\n",
- "0 What does \"backprop\" mean? Is the \"backprop\" t... \n",
- "1 Does increasing the noise in data help to impr... \n",
- "2 \"Backprop\" is the same as \"backpropagation\": i... \n",
- "3 When you're writing your algorithm, how do you... \n",
- "4 Given the following definition of an intellige... \n",
- "... ... \n",
- "23174 The purpose of evaluating the state and action... \n",
- "23175 In machine translation, convolution is a techn... \n",
- "23176 One of the key features of ChatGPT is its abil... \n",
- "23177 Given a neural network model for Covid-19 clas... \n",
- "23178 My question is more related to the fundamental... \n",
- "\n",
- " Tags \\\n",
- "0 23179 rows × 4 columns What does \"backprop\" mean? Is the \"backprop... 8 \n",
+ "1 1008 Does increasing the noise in data help to i... 8 \n",
+ "2 0 \"Backprop\" is the same as \"backpropagation\"... 4 \n",
+ "3 1266 When you're writing your algorithm, how do ... 8 \n",
+ "4 279 Given the following definition of an intell... 29 \n",
+ "\n",
+ " LastEditorUserId LastEditDate ... AnswerCount CommentCount \\\n",
+ "0 2444 2019-11-16T17:56:22.093 ... 5 0 \n",
+ "1 2444 2019-02-23T22:36:19.090 ... 3 0 \n",
+ "2 NaN NaN ... 0 0 \n",
+ "3 2444 2021-01-19T23:54:07.813 ... 4 0 \n",
+ "4 2444 2019-06-15T18:25:58.513 ... 2 0 \n",
+ "\n",
+ " ContentLicense ParentId ClosedDate FavoriteCount CommunityOwnedDate \\\n",
+ "0 CC BY-SA 4.0 0 NaN NaN NaN \n",
+ "1 CC BY-SA 4.0 0 NaN NaN NaN \n",
+ "2 CC BY-SA 3.0 1 NaN NaN NaN \n",
+ "3 CC BY-SA 3.0 0 NaN NaN NaN \n",
+ "4 CC BY-SA 4.0 0 NaN NaN NaN \n",
+ "\n",
+ " LastEditorDisplayName OwnerDisplayName \\\n",
+ "0 NaN NaN \n",
+ "1 NaN NaN \n",
+ "2 NaN NaN \n",
+ "3 NaN NaN \n",
+ "4 NaN NaN \n",
+ "\n",
+ " DataSource \n",
+ "0 https://ia600107.us.archive.org/view_archive.p... \n",
+ "1 https://ia600107.us.archive.org/view_archive.p... \n",
+ "2 https://ia600107.us.archive.org/view_archive.p... \n",
+ "3 https://ia600107.us.archive.org/view_archive.p... \n",
+ "4 https://ia600107.us.archive.org/view_archive.p... \n",
+ "\n",
+ "[5 rows x 23 columns]"
+ ],
+ "text/html": [
+ "\n",
+ " 5 rows × 23 columns What does \"backprop\" mean? Is the \"backprop... \n",
+ "1 Does increasing the noise in data help to i... \n",
+ "2 \"Backprop\" is the same as \"backpropagation\"... \n",
+ "3 When you're writing your algorithm, how do ... \n",
+ "4 Given the following definition of an intell... \n",
+ "... ... \n",
+ "23174 The purpose of evaluating the state and act... \n",
+ "23175 In machine translation, convolution is a te... \n",
+ "23176 One of the key features of ChatGPT is its a... \n",
+ "23177 Given a neural network model for Covid-19 c... \n",
+ "23178 My question is more related to the fundamen... \n",
+ "\n",
+ " BodyClean \\\n",
+ "0 What does \"backprop\" mean? Is the \"backprop\" t... \n",
+ "1 Does increasing the noise in data help to impr... \n",
+ "2 \"Backprop\" is the same as \"backpropagation\": i... \n",
+ "3 When you're writing your algorithm, how do you... \n",
+ "4 Given the following definition of an intellige... \n",
+ "... ... \n",
+ "23174 The purpose of evaluating the state and action... \n",
+ "23175 In machine translation, convolution is a techn... \n",
+ "23176 One of the key features of ChatGPT is its abil... \n",
+ "23177 Given a neural network model for Covid-19 clas... \n",
+ "23178 My question is more related to the fundamental... \n",
+ "\n",
+ " Tags \\\n",
+ "0 23179 rows × 4 columns\n",
- " \n",
- "
\n",
- "\n",
- " \n",
- " \n",
- " \n",
- " \n",
- " Body \n",
- " BodyClean \n",
- " Tags \n",
- " TagsClean \n",
- " \n",
- " \n",
- " 0 \n",
- " <p>What does \"backprop\" mean? Is the \"backprop... \n",
- " What does \"backprop\" mean? Is the \"backprop\" t... \n",
- " <neural-networks><backpropagation><terminology... \n",
- " neural networks, backpropagation, terminology,... \n",
- " \n",
- " \n",
- " 1 \n",
- " <p>Does increasing the noise in data help to i... \n",
- " Does increasing the noise in data help to impr... \n",
- " <neural-networks><machine-learning><statistica... \n",
- " neural networks, machine learning, statistical... \n",
- " \n",
- " \n",
- " 2 \n",
- " <p>\"Backprop\" is the same as \"backpropagation\"... \n",
- " \"Backprop\" is the same as \"backpropagation\": i... \n",
- " NaN \n",
- " NaN \n",
- " \n",
- " \n",
- " 3 \n",
- " <p>When you're writing your algorithm, how do ... \n",
- " When you're writing your algorithm, how do you... \n",
- " <neural-networks><hyperparameter-optimization>... \n",
- " neural networks, hyperparameter optimization, ... \n",
- " \n",
- " \n",
- " 4 \n",
- " <p>Given the following definition of an intell... \n",
- " Given the following definition of an intellige... \n",
- " <philosophy><definitions><intelligent-agent> \n",
- " philosophy, definitions, intelligent agent \n",
- " \n",
- " \n",
- " ... \n",
- " ... \n",
- " ... \n",
- " ... \n",
- " ... \n",
- " \n",
- " \n",
- " 23174 \n",
- " <p>The purpose of evaluating the state and act... \n",
- " The purpose of evaluating the state and action... \n",
- " NaN \n",
- " NaN \n",
- " \n",
- " \n",
- " 23175 \n",
- " <p>In machine translation, convolution is a te... \n",
- " In machine translation, convolution is a techn... \n",
- " NaN \n",
- " NaN \n",
- " \n",
- " \n",
- " 23176 \n",
- " <p>One of the key features of ChatGPT is its a... \n",
- " One of the key features of ChatGPT is its abil... \n",
- " NaN \n",
- " NaN \n",
- " \n",
- " \n",
- " 23177 \n",
- " <p>Given a neural network model for Covid-19 c... \n",
- " Given a neural network model for Covid-19 clas... \n",
- " <neural-networks><homework> \n",
- " neural networks, homework \n",
- " \n",
- " \n",
- " \n",
- "23178 \n",
- " <p>My question is more related to the fundamen... \n",
- " My question is more related to the fundamental... \n",
- " <search><constraint-satisfaction-problems> \n",
- " search, constraint satisfaction problems \n",
- " \n",
- " \n",
- "
\n",
- "\n",
- " \n",
- " \n",
- " \n",
- " \n",
- " Id_q \n",
- " Question \n",
- " QuestionScore \n",
- " QuestionTags \n",
- " Id_a \n",
- " Answer \n",
- " AnswerScore \n",
- " AcceptedAnswerFlag \n",
- " \n",
- " \n",
- " 0 \n",
- " 1768 \n",
- " In Portal 2 we see that AI's can be \"killed\" b... \n",
- " 175 \n",
- " philosophy, decision theory, mythology of ai, ... \n",
- " 1769.0 \n",
- " This classic problem exhibits a basic misunder... \n",
- " 146.0 \n",
- " True \n",
- " \n",
- " \n",
- " 1 \n",
- " 10623 \n",
- " What is self-supervised learning in machine le... \n",
- " 91 \n",
- " machine learning, comparison, supervised learn... \n",
- " 10624.0 \n",
- " Introduction\\nThe term self-supervised learnin... \n",
- " 90.0 \n",
- " True \n",
- " \n",
- " \n",
- " 2 \n",
- " 111 \n",
- " Obviously, self-driving cars aren't perfect, s... \n",
- " 100 \n",
- " philosophy, ethics, autonomous vehicles, decis... \n",
- " 1790.0 \n",
- " \\nHow could self-driving cars make ethical dec... \n",
- " 76.0 \n",
- " True \n",
- " \n",
- " \n",
- " 3 \n",
- " 14224 \n",
- " If the original purpose for developing AI was ... \n",
- " 69 \n",
- " philosophy, social, explainable ai \n",
- " 14247.0 \n",
- " As argued by Selvaraju et al., there are three... \n",
- " 75.0 \n",
- " True \n",
- " \n",
- " \n",
- " \n",
- "4 \n",
- " 1479 \n",
- " Do scientists or research experts know from th... \n",
- " 94 \n",
- " neural networks, deep learning, convolutional ... \n",
- " 4044.0 \n",
- " There are many approaches that aim to make a t... \n",
- " 69.0 \n",
- " True \n",
- " \n",
- " \n",
- "
\n",
- "\n",
- " \n",
- " \n",
- " \n",
- " \n",
- " Id_q \n",
- " Question \n",
- " ParentId_a \n",
- " AcceptedAnswerId \n",
- " Id_a \n",
- " Answer \n",
- " AnswerScore \n",
- " AcceptedAnswerFlag \n",
- " \n",
- " \n",
- " 7 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15744.0 \n",
- " I think this is a fairly common misconception ... \n",
- " 62.0 \n",
- " True \n",
- " \n",
- " \n",
- " 3662 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15753.0 \n",
- " I think your premise is flawed.\\nYou seem to a... \n",
- " 19.0 \n",
- " False \n",
- " \n",
- " \n",
- " 3713 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15747.0 \n",
- " TL;DR: The subtleties of infinity are made app... \n",
- " 12.0 \n",
- " False \n",
- " \n",
- " \n",
- " 3788 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15756.0 \n",
- " In Haskell, you can type:\\nprint [1..]\\nand it... \n",
- " 9.0 \n",
- " False \n",
- " \n",
- " \n",
- " 3821 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15758.0 \n",
- " I believe humans can be said to understand inf... \n",
- " 8.0 \n",
- " False \n",
- " \n",
- " \n",
- " 3882 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15762.0 \n",
- " (There's a summary at the bottom for those who... \n",
- " 7.0 \n",
- " False \n",
- " \n",
- " \n",
- " 4389 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15783.0 \n",
- " Then premise assumes that humans \"understand\" ... \n",
- " 4.0 \n",
- " False \n",
- " \n",
- " \n",
- " 4849 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15740.0 \n",
- " By adding some rules for infinity in arithmeti... \n",
- " 3.0 \n",
- " False \n",
- " \n",
- " \n",
- " 4850 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15803.0 \n",
- " I think the concept that is missing in the dis... \n",
- " 3.0 \n",
- " False \n",
- " \n",
- " \n",
- " 5763 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15768.0 \n",
- " Computers don't understand \"infinity\" or even ... \n",
- " 2.0 \n",
- " False \n",
- " \n",
- " \n",
- " 5764 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15810.0 \n",
- " The Questions That Computers Can Never Answer ... \n",
- " 2.0 \n",
- " False \n",
- " \n",
- " \n",
- " 5765 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15943.0 \n",
- " John Doucette's answer covers my thoughts on t... \n",
- " 2.0 \n",
- " False \n",
- " \n",
- " \n",
- " 7462 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15779.0 \n",
- " I would think that a computer couldn’t underst... \n",
- " 1.0 \n",
- " False \n",
- " \n",
- " \n",
- " 7463 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15787.0 \n",
- " The \"concept\" of infinity is 1 thing to unders... \n",
- " 1.0 \n",
- " False \n",
- " \n",
- " \n",
- " 7464 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15801.0 \n",
- " Just food for thought: how about if we try to ... \n",
- " 1.0 \n",
- " False \n",
- " \n",
- " \n",
- " 7465 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15930.0 \n",
- " Its arguable if we humans understand infinity.... \n",
- " 1.0 \n",
- " False \n",
- " \n",
- " \n",
- " 7466 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15934.0 \n",
- " Well -- just to touch on the question of peopl... \n",
- " 1.0 \n",
- " False \n",
- " \n",
- " \n",
- " 7467 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15938.0 \n",
- " Humans certainly don't understand infinity. Cu... \n",
- " 1.0 \n",
- " False \n",
- " \n",
- " \n",
- " \n",
- "9481 \n",
- " 15730 \n",
- " As a human being, we can think infinity. In pr... \n",
- " 15730.0 \n",
- " 15744 \n",
- " 15931.0 \n",
- " I think the property humans have which compute... \n",
- " 0.0 \n",
- " False \n",
- " \n",
+ " \n",
+ "
\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " Id \n",
+ " PostTypeId \n",
+ " AcceptedAnswerId \n",
+ " CreationDate \n",
+ " Score \n",
+ " ViewCount \n",
+ " Body \n",
+ " OwnerUserId \n",
+ " LastEditorUserId \n",
+ " LastEditDate \n",
+ " ... \n",
+ " AnswerCount \n",
+ " CommentCount \n",
+ " ContentLicense \n",
+ " ParentId \n",
+ " ClosedDate \n",
+ " FavoriteCount \n",
+ " CommunityOwnedDate \n",
+ " LastEditorDisplayName \n",
+ " OwnerDisplayName \n",
+ " DataSource \n",
+ " \n",
+ " \n",
+ " 0 \n",
+ " 1 \n",
+ " 1 \n",
+ " 3 \n",
+ " 2016-08-02T15:39:14.947 \n",
+ " 10 \n",
+ " 710 \n",
+ " <p>What does \"backprop\" mean? Is the \"backprop... \n",
+ " 8 \n",
+ " 2444 \n",
+ " 2019-11-16T17:56:22.093 \n",
+ " ... \n",
+ " 5 \n",
+ " 0 \n",
+ " CC BY-SA 4.0 \n",
+ " 0 \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " https://ia600107.us.archive.org/view_archive.p... \n",
+ " \n",
+ " \n",
+ " 1 \n",
+ " 2 \n",
+ " 1 \n",
+ " 9 \n",
+ " 2016-08-02T15:40:20.623 \n",
+ " 14 \n",
+ " 1008 \n",
+ " <p>Does increasing the noise in data help to i... \n",
+ " 8 \n",
+ " 2444 \n",
+ " 2019-02-23T22:36:19.090 \n",
+ " ... \n",
+ " 3 \n",
+ " 0 \n",
+ " CC BY-SA 4.0 \n",
+ " 0 \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " https://ia600107.us.archive.org/view_archive.p... \n",
+ " \n",
+ " \n",
+ " 2 \n",
+ " 3 \n",
+ " 2 \n",
+ " 0 \n",
+ " 2016-08-02T15:40:24.820 \n",
+ " 15 \n",
+ " 0 \n",
+ " <p>\"Backprop\" is the same as \"backpropagation\"... \n",
+ " 4 \n",
+ " NaN \n",
+ " NaN \n",
+ " ... \n",
+ " 0 \n",
+ " 0 \n",
+ " CC BY-SA 3.0 \n",
+ " 1 \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " https://ia600107.us.archive.org/view_archive.p... \n",
+ " \n",
+ " \n",
+ " 3 \n",
+ " 4 \n",
+ " 1 \n",
+ " 12 \n",
+ " 2016-08-02T15:41:22.020 \n",
+ " 33 \n",
+ " 1266 \n",
+ " <p>When you're writing your algorithm, how do ... \n",
+ " 8 \n",
+ " 2444 \n",
+ " 2021-01-19T23:54:07.813 \n",
+ " ... \n",
+ " 4 \n",
+ " 0 \n",
+ " CC BY-SA 3.0 \n",
+ " 0 \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " https://ia600107.us.archive.org/view_archive.p... \n",
+ " \n",
+ " \n",
+ " \n",
+ "4 \n",
+ " 6 \n",
+ " 1 \n",
+ " 20 \n",
+ " 2016-08-02T15:43:35.460 \n",
+ " 7 \n",
+ " 279 \n",
+ " <p>Given the following definition of an intell... \n",
+ " 29 \n",
+ " 2444 \n",
+ " 2019-06-15T18:25:58.513 \n",
+ " ... \n",
+ " 2 \n",
+ " 0 \n",
+ " CC BY-SA 4.0 \n",
+ " 0 \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " NaN \n",
+ " https://ia600107.us.archive.org/view_archive.p... \n",
+ " \n",
+ " \n",
+ "
\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " Body \n",
+ " BodyClean \n",
+ " Tags \n",
+ " TagsClean \n",
+ " \n",
+ " \n",
+ " 0 \n",
+ " <p>What does \"backprop\" mean? Is the \"backprop... \n",
+ " What does \"backprop\" mean? Is the \"backprop\" t... \n",
+ " <neural-networks><backpropagation><terminology... \n",
+ " neural networks, backpropagation, terminology,... \n",
+ " \n",
+ " \n",
+ " 1 \n",
+ " <p>Does increasing the noise in data help to i... \n",
+ " Does increasing the noise in data help to impr... \n",
+ " <neural-networks><machine-learning><statistica... \n",
+ " neural networks, machine learning, statistical... \n",
+ " \n",
+ " \n",
+ " 2 \n",
+ " <p>\"Backprop\" is the same as \"backpropagation\"... \n",
+ " \"Backprop\" is the same as \"backpropagation\": i... \n",
+ " NaN \n",
+ " NaN \n",
+ " \n",
+ " \n",
+ " 3 \n",
+ " <p>When you're writing your algorithm, how do ... \n",
+ " When you're writing your algorithm, how do you... \n",
+ " <neural-networks><hyperparameter-optimization>... \n",
+ " neural networks, hyperparameter optimization, ... \n",
+ " \n",
+ " \n",
+ " 4 \n",
+ " <p>Given the following definition of an intell... \n",
+ " Given the following definition of an intellige... \n",
+ " <philosophy><definitions><intelligent-agent> \n",
+ " philosophy, definitions, intelligent agent \n",
+ " \n",
+ " \n",
+ " ... \n",
+ " ... \n",
+ " ... \n",
+ " ... \n",
+ " ... \n",
+ " \n",
+ " \n",
+ " 23174 \n",
+ " <p>The purpose of evaluating the state and act... \n",
+ " The purpose of evaluating the state and action... \n",
+ " NaN \n",
+ " NaN \n",
+ " \n",
+ " \n",
+ " 23175 \n",
+ " <p>In machine translation, convolution is a te... \n",
+ " In machine translation, convolution is a techn... \n",
+ " NaN \n",
+ " NaN \n",
+ " \n",
+ " \n",
+ " 23176 \n",
+ " <p>One of the key features of ChatGPT is its a... \n",
+ " One of the key features of ChatGPT is its abil... \n",
+ " NaN \n",
+ " NaN \n",
+ " \n",
+ " \n",
+ " 23177 \n",
+ " <p>Given a neural network model for Covid-19 c... \n",
+ " Given a neural network model for Covid-19 clas... \n",
+ " <neural-networks><homework> \n",
+ " neural networks, homework \n",
+ " \n",
+ " \n",
+ " \n",
+ "23178 \n",
+ " <p>My question is more related to the fundamen... \n",
+ " My question is more related to the fundamental... \n",
+ " <search><constraint-satisfaction-problems> \n",
+ " search, constraint satisfaction problems \n",
+ " \n",
+ " \n",
+ "
\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " Id_q \n",
+ " Question \n",
+ " QuestionScore \n",
+ " QuestionTags \n",
+ " Id_a \n",
+ " Answer \n",
+ " AnswerScore \n",
+ " AcceptedAnswerFlag \n",
+ " \n",
+ " \n",
+ " 0 \n",
+ " 1768 \n",
+ " In Portal 2 we see that AI's can be \"killed\" b... \n",
+ " 175 \n",
+ " philosophy, decision theory, mythology of ai, ... \n",
+ " 1769.0 \n",
+ " This classic problem exhibits a basic misunder... \n",
+ " 146.0 \n",
+ " True \n",
+ " \n",
+ " \n",
+ " 1 \n",
+ " 10623 \n",
+ " What is self-supervised learning in machine le... \n",
+ " 91 \n",
+ " machine learning, comparison, supervised learn... \n",
+ " 10624.0 \n",
+ " Introduction\\nThe term self-supervised learnin... \n",
+ " 90.0 \n",
+ " True \n",
+ " \n",
+ " \n",
+ " 2 \n",
+ " 111 \n",
+ " Obviously, self-driving cars aren't perfect, s... \n",
+ " 100 \n",
+ " philosophy, ethics, autonomous vehicles, decis... \n",
+ " 1790.0 \n",
+ " \\nHow could self-driving cars make ethical dec... \n",
+ " 76.0 \n",
+ " True \n",
+ " \n",
+ " \n",
+ " 3 \n",
+ " 14224 \n",
+ " If the original purpose for developing AI was ... \n",
+ " 69 \n",
+ " philosophy, social, explainable ai \n",
+ " 14247.0 \n",
+ " As argued by Selvaraju et al., there are three... \n",
+ " 75.0 \n",
+ " True \n",
+ " \n",
+ " \n",
+ " \n",
+ "4 \n",
+ " 1479 \n",
+ " Do scientists or research experts know from th... \n",
+ " 94 \n",
+ " neural networks, deep learning, convolutional ... \n",
+ " 4044.0 \n",
+ " There are many approaches that aim to make a t... \n",
+ " 69.0 \n",
+ " True \n",
+ " \n",
+ " \n",
+ "
\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ " Id_q \n",
+ " Question \n",
+ " ParentId_a \n",
+ " AcceptedAnswerId \n",
+ " Id_a \n",
+ " Answer \n",
+ " AnswerScore \n",
+ " AcceptedAnswerFlag \n",
+ " \n",
+ " \n",
+ " 7 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15744.0 \n",
+ " I think this is a fairly common misconception ... \n",
+ " 62.0 \n",
+ " True \n",
+ " \n",
+ " \n",
+ " 3662 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15753.0 \n",
+ " I think your premise is flawed.\\nYou seem to a... \n",
+ " 19.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 3713 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15747.0 \n",
+ " TL;DR: The subtleties of infinity are made app... \n",
+ " 12.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 3788 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15756.0 \n",
+ " In Haskell, you can type:\\nprint [1..]\\nand it... \n",
+ " 9.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 3821 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15758.0 \n",
+ " I believe humans can be said to understand inf... \n",
+ " 8.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 3882 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15762.0 \n",
+ " (There's a summary at the bottom for those who... \n",
+ " 7.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 4389 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15783.0 \n",
+ " Then premise assumes that humans \"understand\" ... \n",
+ " 4.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 4849 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15740.0 \n",
+ " By adding some rules for infinity in arithmeti... \n",
+ " 3.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 4850 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15803.0 \n",
+ " I think the concept that is missing in the dis... \n",
+ " 3.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 5763 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15768.0 \n",
+ " Computers don't understand \"infinity\" or even ... \n",
+ " 2.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 5764 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15810.0 \n",
+ " The Questions That Computers Can Never Answer ... \n",
+ " 2.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 5765 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15943.0 \n",
+ " John Doucette's answer covers my thoughts on t... \n",
+ " 2.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 7462 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15779.0 \n",
+ " I would think that a computer couldn’t underst... \n",
+ " 1.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 7463 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15787.0 \n",
+ " The \"concept\" of infinity is 1 thing to unders... \n",
+ " 1.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 7464 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15801.0 \n",
+ " Just food for thought: how about if we try to ... \n",
+ " 1.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 7465 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15930.0 \n",
+ " Its arguable if we humans understand infinity.... \n",
+ " 1.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 7466 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15934.0 \n",
+ " Well -- just to touch on the question of peopl... \n",
+ " 1.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " 7467 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15938.0 \n",
+ " Humans certainly don't understand infinity. Cu... \n",
+ " 1.0 \n",
+ " False \n",
+ " \n",
+ " \n",
+ " \n",
+ "9481 \n",
+ " 15730 \n",
+ " As a human being, we can think infinity. In pr... \n",
+ " 15730.0 \n",
+ " 15744 \n",
+ " 15931.0 \n",
+ " I think the property humans have which compute... \n",
+ " 0.0 \n",
+ " False \n",
+ "