Pinokio

From Notes_Wiki
Revision as of 07:07, 22 December 2025 by Saurabh (talk | contribs)

Home > Local system based AI tools > Pinokio

Install Pinoko AppImage

To download and install Pinokio on local system use:

  1. Go to https://github.com/pinokiocomputer/pinokio/releases
  2. Download AppImage and run via AppImageLauncher. See Rocky 9.x Owncloud client via AppImage
  3. Then run pinokio via graphical run options.
    1. For this in Rocky 9.x use:
      chmod +x Pinokio.AppImage
      ./Piniokio.AppImage
    2. For this in Ubuntu 24.04 use:
      chmod +x Pinokio.AppImage
      ./Pinoko.AppImage --no-sandbox
  4. During first run set a parent folder in location where you have enough space (at least 300GB+)
  5. Install recommended tools from Overall list of useful AI tools

Refer: - https://program.pinokio.computer/#/?id=linux


Useful tools within Pinokio

Dia

From https://github.com/nari-labs/dia We can use dia for text to speech generation. It also has option for speech to text.

Installation

After installing dia via pinokio one click it may not work. Use below steps to solve the issue:

  1. cd pinokio-files/api/dia.git/
  2. source app/env/bin/activate
  3. pip uninstall dac
  4. pip install git+https://github.com/descriptinc/descript-audio-codec.git
    Learned from https://github.com/nari-labs/dia/issues/140
  5. After this close Pinkio and open it again. Then run dia in Pinokio and it should open this time.


Usage

  1. You can type text in "Text to Generate" box and Let it generate. Then play or download the file.
  2. It will automatically show generation total time for ease of use. File is generated in wav format. There is download button at top right corner of play option tool box to download the file.
  3. Use [S1] or [S2] for speech boy or girl options



Browser-use

From https://github.com/browser-use/web-ui We can benefit from browser-use by using AI (including vision models if required) for performing tasks on sites. Browser-use can open a browser and perform actions on sites. Browser-use supports local ollama so we can perform the automation locally without depending on SaaS based AI.

To configure browser-use use:

  1. Install browser-use using pinokio one click install
  2. Ensure local ollama is installed properly and running. Check for qwen3-vl:4b model installation via
    ollama list
  3. In Agent settings tab:
    1. Use "LLM Provder": Ollama with "LLM Model Name": qwen3-vl:4b. Enable use of vision
    2. Use same for Planner LLM provider including vision for planner LLM also
      We are enabling vision and using a model 4b that can fit entirely in 8GB VRAM GPU card. If you have more or less GPU VRAM adjust model accordingly. We dont want to use CPU for vision as that will be too slow.
      Vision is only required for certain tasks. If HTML body of page would be enough for model to work then we can use non-vision models eg gpt-oss:20b which might run slower from CPU compared to GPU
      Open in other browser http://127.0.0.1:7788/ to change settings. For some reason changing model in Pinokio UI does not works unless we close pinokio and open again.
  4. In Browser Settings
    1. Disable "Keep Browser Open"
    2. Enable "Headless mode". We can always see what is visible in browser in the "Run agent" tab and we also get gif under "Task output" once task is complete or stopped.
    3. Disable Browser security
  5. In Run Agent we can give task such as "Open sbarjatiya.com/notes_wiki and search for article on how to configure swap space using file in Linux"
  6. Click "Submit Task" and wait for results
  7. Agent interaction page shows various browser screenshots and next action plan based on what was displayed in browser
  8. Task recording gif at bottom shows the various screens of browser with descriptive overlay text
  9. Overall results are not very impressive. More research is required to get better output from this tool.



Home > Local system based AI tools > Pinokio