在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称:blueprints-for-text-analytics-python/blueprints-text开源软件地址:https://github.com/blueprints-for-text-analytics-python/blueprints-text开源编程语言:Jupyter Notebook 75.4%开源软件介绍:Blueprints for Text Analytics Using PythonMachine Learning Based Solutions for Common Real World (NLP) ApplicationsJens Albrecht, Sidharth Ramachandran, Christian Winkler Published by O'Reilly, 2020 Find the book at If you like the book or the code examples here, please leave a friendly comment on Amazon! Download your free chapter now!Free download of Chapter 7 "How to Explain a Classifier". Content of this RepositoryThis repository is currently in preparation. Please do not yet send any comments. This repository contains the code examples of our O'Reilly book. You will find a subdirectory for each chapter containing a Jupyter notebook and additional files for the setup. Below you find the links to view the notebooks here on Github or execute them directly on Google Colab. In the section thereafter you will find instructions to setup the environment on your local computer. Problems and errorsIf you discover any problems or have recommendations on how to improve the code, do not hesitate to create an issue here in the repository. For errors in the book text, please use O'Reilly's errata page. spaCy 3.0 and Gensim 4.0 The book uses spaCy 2.3.2 and gensim 3.8.3. spaCy 3.0 is now officially release with several new features and a few API changes (https://spacy.io/usage/v3). Gensim 4.0 is in beta (https://github.com/RaRe-Technologies/gensim/releases). We are already updating our notebooks. But currently textacy is not yet supporting spaCy 3.0, although work is already in progress (see this pull request from us). Until textacy for spaCy 3.0 is released, you can use our own fork for the installation (see blueprints.yaml in this directory). View or Run the NotebooksFor each chapter of the book we provide three links:
If you run the notebook locally or on Colab, you can execute each cell separately by hitting "Shift-enter". Do not skip cells and don't forget to run the first code cells for the setup.
Local SetupThe following instructions should work on Linux, Windows and MacOS. If you are a Windows user familiar with Linux, you should check out the Windows Subsystem for Linux, Version 2 (WSL2). This allows to use a Linux system on the Windows machine. However, using native Windows should also be no problem. It is helpful to install git clone https://github.com/blueprints-for-text-analytics-python/blueprints-text.git
cd blueprints-text Otherwise download the zip file, unpack it to a location convenient to you, and open a command line terminal in the project directory For local setup, we recommend to use Miniconda, a minimal version of the popular Anaconda distribution that contains only the package manager After installation of Anaconda/Miniconda, run the following command(s) from the project directory: conda env create --name blueprints --file blueprints.yml
conda activate blueprints The prompt should change after activation and indicate that you are working in the jupyter nbextension enable toc2/main
jupyter nbextension enable execute_time/ExecuteTime
jupyter nbextension enable varInspector/main Now you can start the Jupyter notebook server: jupyter notebook If working on WSL under Windows, add Browse to the respective chapter and open the notebook file (suffix |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论