Flowise/packages/components/nodes/documentloaders
Ademílson Tonato 572fb31a1c
chore: update Firecrawl version and add FirecrawlExtractTool (#4073)
* chore: update Firecrawl version and add FirecrawlExtractTool

* refactor: update outputs format

* chore: update Firecrawl request headers to include X-Origin and X-Origin-Type

* feat: add FireCrawl testing suite for scraping, crawling, and data extraction

- Introduced FireCrawl-TEST.ts to validate FireCrawlLoader functionality.
- Implemented tests for basic scraping, crawling with text splitting, data extraction, and extract status retrieval.
- Enhanced error handling in FireCrawlLoader for better debugging.

* Update pnpm-lock.yaml

* refactor: FireCrawl API integration to improve parameter handling and error logging

* refractor firecrawl

* Update FireCrawl.ts

removed console log

* Update pnpm-lock.yaml

* Update pnpm-lock.yaml

---------

Co-authored-by: Ong Chung Yau <33013947+chungyau97@users.noreply.github.com>
Co-authored-by: Henry <hzj94@hotmail.com>
Co-authored-by: Henry Heng <henryheng@flowiseai.com>
2025-05-27 14:58:35 +01:00
..
API Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Airtable Enhance Airtable Document Loader with Filter and Text Output (#3074) 2024-08-25 13:26:39 +01:00
ApifyWebsiteContentCrawler Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
BraveSearchAPI Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Cheerio Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Confluence Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Csv Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
CustomDocumentLoader Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
DocumentStore Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Docx Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Epub Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Figma Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
File Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
FireCrawl chore: update Firecrawl version and add FirecrawlExtractTool (#4073) 2025-05-27 14:58:35 +01:00
Folder Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Gitbook Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Github feat: Enterprise Github (#4221) 2025-04-03 01:17:56 +08:00
Jira Feature/agentflow v2 (#4298) 2025-05-10 10:21:26 +08:00
Json Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Jsonlines Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Notion Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Pdf Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
PlainText Feature/Ability to omit all metadata keys using asterisk (#2401) 2024-05-13 16:30:57 +01:00
Playwright Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Puppeteer Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
S3Directory [Feature] improve CsvLoader & clean code (#3830) 2025-01-14 16:47:04 +00:00
S3File Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
SearchApi Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
SerpApi Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Spider Chore/LC v0.3 (#3517) 2024-11-28 11:06:12 +00:00
Text Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
Unstructured Chore/refractor (#4454) 2025-05-27 07:29:42 +01:00
VectorStoreToDocument Bugfix/Missing Filter for VectorStore to Document (#2285) 2024-04-29 22:25:40 +01:00