revised README.md

decoupled
2023-09-11 21:04:51 +03:00 · 2023-09-11 20:35:55 +03:00
8 changed files with 135 additions and 158 deletions
--- a/README.md
+++ b/README.md
@ -1,67 +1,76 @@
-# AnkiAI
+# AnkiAI - Automated Anki Deck Creator

-AnkiAI is a robust system that converts images containing text into structured Anki cards using Optical Character Recognition (OCR) and OpenAI's GPT-4 language model. Users can quickly generate decks of flashcards from their images for effective study.
+AnkiAI is a tool that leverages OCR (Optical Character Recognition) and GPT-3's powerful natural language processing capabilities to automatically generate Anki decks from images containing text.

-## Features
- Converts image content to textual content using OCR.
- Uses OpenAI's GPT-4 model to structure the content into Anki decks and cards.
- Outputs the structured content as an Anki package.
+### Overview

-## Dependencies
- genanki: Used for creating Anki decks and cards.
- Pillow: Image processing library.
- openai: API library for OpenAI's GPT-4 model.
- flask: Web server to host the service.
+- AnkiAI is designed to streamline the process of creating Anki decks from images.
+- The core idea is to use OCR to extract text from images and then use GPT-3 to transform this text into a structured Anki deck format.
+- Users can make a POST request to a Flask server endpoint with their images to receive the Anki deck (.apkg file).

-## Setup and Installation
+### Directory Structure
+
+- `.vscode/`: Contains configuration for VSCode debugger for Flask applications.
+- `ankiai.py`: The main script that drives the creation of Anki decks from images.
+- `constants.py`: Contains constant variables used across the project.
+- `deck_creation.py`: Contains logic for communicating with OpenAI's API and deck creation using genanki.
+- `image_processing.py`: Processes images, converting them for OCR and then performing OCR to extract text.
+- `logging_config.py`: Logging configuration for the entire project.
+- `server.py`: Flask server that provides an API endpoint to upload images and get back an Anki deck.
+
+### Requirements
+
+To run AnkiAI, you'll need to have the following dependencies installed:
+
+```
+genanki==0.8.0
+Pillow
+openai
+flask
+```
+
+You can install these via `pip` using the `requirements.txt` file:
+
+```bash
+pip install -r requirements.txt
+```
+
+### How to Run
+
+1. **Environment Variables**: Make sure to set the `OPENAI_API_KEY` environment variable to your OpenAI API key.
+   
+2. **Run the Flask server**:

-1. Clone this repository:
   ```bash
-   git clone https://git.rudefox.io/bj/anki-json2ankicards.git
-   cd json2ankicards
+   python server.py
   ```

-2. Set up a virtual environment and activate it:
+   This will start the Flask server. You can then make a POST request to `http://localhost:5000/deck-from-images` with your images to get an Anki deck.
+
+3. **Run Directly**:
+
+   If you prefer not to use the Flask server, you can also run `ankiai.py` directly:
+
   ```bash
-   python3 -m venv venv
-   source venv/bin/activate
+   python ankiai.py <directory_path_containing_images>
   ```

-3. Install the required packages:
-   ```bash
-   pip install -r requirements.txt
-   ```
+### How to Debug (VSCode Users)

-4. Set up the OpenAI API key:
-    ```bash
-    export OPENAI_API_KEY=your_openai_api_key
-    ```
+- Open the project in VSCode.
+- Set up your breakpoints.
+- Use the VSCode debugger and select "Python: Flask" to start debugging the Flask server.

-5. Run the server:
-    ```bash
-    python server.py
-    ```
+### Important Notes

-## Usage
+- **API Key**: For the project to work, it is essential to have the `OPENAI_API_KEY` environment variable set.
+- **Image Types**: Currently, the image processing module supports PNG, JPG, and JPEG formats.
+- **Output**: The output `.apkg` file (Anki package file) will be named `out.apkg`.

-1. Start the server as mentioned above.
+### Acknowledgements

-2. Use a tool like [Postman](https://www.postman.com/) or `curl` to send images to `http://localhost:5000/deck-from-images` as a multi-part POST request.
+This project heavily relies on the `openai` library for processing and the `genanki` library for deck generation.

-3. The server will respond with a downloadable Anki package. Import this into your Anki app and start studying!
+### Contributions

-## Modules
-
-1. **ankiai.py**: The main module that orchestrates the flow.
-2. **images2text.py**: Converts image content into text using OCR.
-3. **json2deck.py**: Converts structured JSON data into an Anki package.
-4. **prompt4cards.py**: Uses OpenAI to structure the content into Anki decks and cards.
-5. **server.py**: Flask server to host the service.
-
-## Contributing
-
-Contributions are welcome! Please submit a pull request or open an issue to discuss changes or fixes.
-
-## License
-
-[MIT License](LICENSE)
+Contributions are always welcome. Please create a new issue or a pull request for any bug fixes or feature requests.
--- a/ankiai.py
+++ b/ankiai.py
@ -2,18 +2,18 @@ import sys
 import logging

 from logging_config import setup_logging
-from images2text import main as ocr_images
-from prompt4cards import prompt_for_card_content, response_to_json
-from json2deck import to_package
+from image_processing import process_images
+from deck_creation import prompt_for_card_content, response_to_json, to_package
+
+APKG_FILE = "out.apkg"

 setup_logging()

-def images_to_package(directory_path, outfile):
-    ocr_text = ocr_images(directory_path)
+def images_to_package(directory_path):
+    ocr_text = process_images(directory_path)
    response_text = prompt_for_card_content(ocr_text)
    deck_json = response_to_json(response_text)
-    to_package(deck_json).write_to_file(outfile)
-    logging.info(f"Deck created at: {outfile}")
+    return to_package(deck_json)


 if __name__ == "__main__":
@ -21,4 +21,5 @@ if __name__ == "__main__":
        print("Usage: python ankiai.py <directory_path_containing_images>")
        sys.exit(1)

-    images_to_package(sys.argv[1])
+    images_to_package(sys.argv[1]).write_to_file(APKG_FILE)
+    logging.info(f"Deck created at: {APKG_FILE}")
--- a/constants.py
+++ b/constants.py
@ -1,8 +1,10 @@
 # File and Directory Constants
+IMAGE_KEY="image"
+APKG_FILE="out.apkg"
 CONVERTED_DIR = "converted"
-FINAL_OUTPUT = "final.txt"
+TEXT_OCR_FILE = "final.txt"
 IMAGE_EXTENSIONS = ['.png', '.jpg', '.jpeg']
-OUTPUT_FILENAME = "output_deck.json"
+DECK_JSON_FILE = "output_deck.json"

 # API Constants
 API_KEY_ENV = "OPENAI_API_KEY"
--- a/deck_creation.py
+++ b/deck_creation.py
@ -1,8 +1,12 @@
 import openai
 import os
-import sys
 import json
-from constants import API_KEY_ENV, CHAT_MODEL, OUTPUT_FILENAME
+import genanki
+from logging_config import setup_logging
+from constants import API_KEY_ENV, CHAT_MODEL
+
+
+setup_logging()


 API_KEY = os.environ.get(API_KEY_ENV)
@ -83,21 +87,44 @@ def response_to_json(response_text):
    }


-if __name__ == "__main__":
-    if len(sys.argv) != 2:
-        print("Usage: python prompt4cards.py <text_file_path>")
-        sys.exit(1)
-    
-    text_file_path = sys.argv[1]
+# Create a new model for our cards. This is necessary for genanki.
+MY_MODEL = genanki.Model(
+    1607372319,
+    "Simple Model",
+    fields=[
+        {"name": "Title"},
+        {"name": "Question"},
+        {"name": "Answer"},
+    ],
+    templates=[
+        {
+            "name": "{{Title}}",
+            "qfmt": "{{Question}}",
+            "afmt": "{{FrontSide}}<hr id='answer'>{{Answer}}",
+        },
+    ])

-    # Read the text content
-    with open(text_file_path, 'r') as file:
-        text_content = file.read()
+def json_file_to_package(json_path):
+    with open(json_path, 'r', encoding='utf-8') as f:
+        json_data = json.load(f)
+        package = to_package(json_data)

-    response_text = prompt_for_card_content(text_content)
-    deck_json = response_to_json(response_text)
+    return package

-    with open(OUTPUT_FILENAME, 'w') as json_file:
-        json.dump(deck_json, json_file)
+def to_package(deck_json):
+    deck_title = deck_json["DeckTitle"]
+    deck = genanki.Deck(1607372319, deck_title)

-    print(f"Saved generated deck to {OUTPUT_FILENAME}")
+    for card_json in deck_json["Cards"]:
+        title = card_json["Title"]
+        question = card_json["Question"]
+        answer = card_json["Answer"]
+
+        note = genanki.Note(
+                model=MY_MODEL,
+                fields=[title, question, answer]
+            )
+
+        deck.add_note(note)
+
+    return genanki.Package(deck)
--- a/image_processing.py
+++ b/image_processing.py
@ -5,13 +5,21 @@ import logging
 from logging_config import setup_logging
 from subprocess import run, CalledProcessError
 from concurrent.futures import ThreadPoolExecutor
-from utilities import is_image_file, ensure_directory_exists
-from constants import CONVERTED_DIR, FINAL_OUTPUT
+from constants import CONVERTED_DIR, TEXT_OCR_FILE, IMAGE_EXTENSIONS


 setup_logging()


+def is_image_file(path):
+    return any(path.lower().endswith(ext) for ext in IMAGE_EXTENSIONS)
+
+
+def ensure_directory_exists(directory):
+    if not os.path.exists(directory):
+        os.mkdir(directory)
+
+
 def convert_image(image_path):
    logging.info(f"Converting {image_path}...")
    converted_path = os.path.join(CONVERTED_DIR, os.path.basename(image_path))
@ -62,7 +70,7 @@ def process_image(image_path):
        return None


-def main(directory_path):
+def process_images(directory_path):
    final_text = []

    ensure_directory_exists(CONVERTED_DIR)
@ -80,10 +88,10 @@ def main(directory_path):
    
    # Filter out any None values and write the text to final.txt
    final_text = [text for text in final_text if text is not None]
-    with open(FINAL_OUTPUT, 'w') as f:
+    with open(TEXT_OCR_FILE, 'w') as f:
        f.write("\n".join(final_text))

-    logging.info(f"All images processed! Final output saved to {FINAL_OUTPUT}")
+    logging.info(f"All images processed! Final output saved to {TEXT_OCR_FILE}")
    return final_text  # Add this line


@ -91,4 +99,4 @@ if __name__ == "__main__":
    if len(sys.argv) != 2:
        print("Usage: python images2text.py <directory_path>")
        sys.exit(1)
-    main(sys.argv[1])
+    process_images(sys.argv[1])
--- a/json2deck.py
+++ b/json2deck.py
@ -1,61 +0,0 @@
-import json
-import genanki
-import sys
-import logging
-from logging_config import setup_logging
-
-
-setup_logging()
-
-
-# Create a new model for our cards. This is necessary for genanki.
-MY_MODEL = genanki.Model(
-    1607372319,
-    "Simple Model",
-    fields=[
-        {"name": "Title"},
-        {"name": "Question"},
-        {"name": "Answer"},
-    ],
-    templates=[
-        {
-            "name": "{{Title}}",
-            "qfmt": "{{Question}}",
-            "afmt": "{{FrontSide}}<hr id='answer'>{{Answer}}",
-        },
-    ])
-
-def json_file_to_package(json_path):
-    with open(json_path, 'r', encoding='utf-8') as f:
-        json_data = json.load(f)
-        package = to_package(json_data)
-
-    return package
-
-def to_package(deck_json):
-    deck_title = deck_json["DeckTitle"]
-    deck = genanki.Deck(1607372319, deck_title)
-
-    for card_json in deck_json["Cards"]:
-        title = card_json["Title"]
-        question = card_json["Question"]
-        answer = card_json["Answer"]
-
-        note = genanki.Note(
-                model=MY_MODEL,
-                fields=[title, question, answer]
-            )
-
-        deck.add_note(note)
-
-    return genanki.Package(deck)
-
-if __name__ == "__main__":
-    if len(sys.argv) != 3:
-        print("Usage: python convert.py <input_json> <output_apkg>")
-        sys.exit(1)
-    
-    input_json = sys.argv[1]
-    output_apkg = sys.argv[2]
-    json_file_to_package(input_json).write_to_file(output_apkg)
-    logging.info(f"Deck created at: {output_apkg}")
--- a/server.py
+++ b/server.py
@ -3,17 +3,16 @@ import tempfile
 import shutil
 import logging

-from logging_config import setup_logging
 from flask import Flask, request, send_from_directory, jsonify
 from werkzeug.utils import secure_filename
 from ankiai import images_to_package
-from constants import IMAGE_KEY, OUTPUT_FILE, NO_IMAGE_PART_ERROR, NO_SELECTED_FILE_ERROR, INVALID_FILENAME_ERROR
-
-
-setup_logging()
+from constants import IMAGE_KEY, APKG_FILE, NO_IMAGE_PART_ERROR, NO_SELECTED_FILE_ERROR, INVALID_FILENAME_ERROR


 from logging_config import setup_logging
+setup_logging()
+
+
 app = Flask(__name__)

 def save_uploaded_images(images, directory):
@ -41,8 +40,9 @@ def deck_from_images():
    save_uploaded_images(images, temp_dir)

    try:
-        images_to_package(temp_dir, OUTPUT_FILE)
-        return send_from_directory('.', OUTPUT_FILE, as_attachment=True)
+        images_to_package(temp_dir).write_to_file(APKG_FILE)
+        logging.info(f"Anki package written to {APKG_FILE}")
+        return send_from_directory('.', APKG_FILE, as_attachment=True)
    except Exception as e:
        logging.error("Exception occurred: "+str(e), exc_info=True)
        return jsonify({'error': str(e)}), 500
--- a/utilities.py
+++ b/utilities.py
@ -1,9 +0,0 @@
-import os
-from constants import IMAGE_EXTENSIONS
-
-def is_image_file(path):
-    return any(path.lower().endswith(ext) for ext in IMAGE_EXTENSIONS)
-
-def ensure_directory_exists(directory):
-    if not os.path.exists(directory):
-        os.mkdir(directory)
Author	SHA1	Message	Date
Benjamin Dweck	e96f23ddc8	revised README.md	2023-09-11 21:04:51 +03:00
Benjamin Dweck	28e6c8d611	decoupled	2023-09-11 20:35:55 +03:00